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Abstract — This paper focuses on the 1-to-K broadcast packet 
erasure channel (PEC), which is a generalization of the broadcast 
binary erasure channel from the binary symbol to that of 
arbitrary finite fields GF(g) with sufficiently large q. We consider 
the setting in which the source node has instant feedback of the 
channel outputs of the A receivers after each transmission. The 
capacity region of the 1-to-A' PEC with COF was previously 
known only for the case A' = 2. Such a setting directly models 
network coded packet transmission in the downlink direction 
with integrated feedback mechanisms (such as Automatic Repeat 
reQuest (ARQ)). 

The main results of this paper are: (i) The capacity region 
for general l-to-3 broadcast PECs, and (ii) The capacity region 
for two types of 1-to-A' broadcast PECs: the symmetric PECs, 
and the spatially independent PECs with one-sided fairness con- 
straints. This paper also develops (iii) A pair of outer and inner 
bounds of the capacity region for arbitrary 1-to-A broadcast 
PECs, which can be easily evaluated by any linear programming 
solver. The proposed inner bound is proven by a new class of 
intersession network coding schemes, termed the packet evolution 
schemes, which is based on the concept of code alignment in GF(g) 
that is in parallel with the interference alignment techniques for 
the Euclidean space. Extensive numerical experiments show that 
the outer and inner bounds meet for almost all broadcast PECs 
encountered in practical scenarios and thus effectively bracket 
the capacity of general 1-to-A broadcast PECs with COF. 

Index Terms — Network coding, packet erasure channels, 
broadcast capacity, channel output feedback, network code align- 
ment. 

I. Introduction 

Broadcast channels have been actively studied since the in- 
ception of network information theory. Although the broadcast 
capacity region remains unknown for general channel models, 
significant progress has been made in various sub-directions 
(see [5'J for a tutorial paper), including but not limited to the 
degraded broadcast channel models J2], the 2-user capacity 
with degraded message sets W2\ or with message side infor- 
mation 1241 . Motivated by wireless broadcast communications, 
the Gaussian broadcast channel (GBC) [231 is among the most 
widely studied broadcast channel models. 

In the last decade, the new network coding concept has 
emerged 1161 . which focuses on achieving the capacity of a 
communication network. More explicitly, the network-coding- 
based approaches generally model each hop of a packet-based 
communication network by a packet erasure channel (PEC) 
instead of the classic Gaussian channel. Such simple abstrac- 
tion allows us to explore the information-theoretic capacity 
of a much larger network with mathematical rigor and also 



sheds new insights on the network effects of a communication 
system. One such example is that when all destinations are 
interested in the same set of packets, the capacity of any arbi- 
trarily large, multi-hop PEC network can be characterized by 
the corresponding min-cut/max-flow values [6), lfl6l . Another 
example is the broadcast channel capacity with message side 
information. Unlike the existing GBC-based results that are 
limited to the simplest 2-user scenario 1241 . the capacity region 
for 1-to-A' broadcast PECs with message side information has 
been derived for K = 3 and tightly bounded for general K 
values ETI . J221F1 In addition to providing new insights on 
network communications, this simple PEC -based abstraction 
in network coding also accelerates the transition from theory 
to practice. Many of the capacity-achieving network codes ||T0| 
have since been implemented for either the wireline [4] or the 
wireless multi-hop networks ifTTI . [T31 . 

Motivated by the state-of-the-art wireless network cod- 
ing protocols and the corresponding applications, this paper 
studies the memoryless l-to-A' broadcast PEC with Channel 
Output Feedback (COF). Namely, a single source node sends 
out a stream of packets wirelessly, which carries information of 
K independent downlink data sessions, one for each receiver 
dk, k = l,--' ,K, respectively. Due to the randomness 
of the underlying wireless channel condition, which varies 
independently for each time slot, each transmitted packet 
may or may not be heard by a receiver dk- After packet 
transmission, each dk then informs the source its own channel 
output by sending back the ACKnowledgement (ACK) packets 
periodically (batch feedback) or after each time slot (per- 
packet instant feedback) l25l . [9) derives the capacity region 
of the memoryless l-to-2 broadcast PEC with COF. The 
results show that COF strictly improves the capacity of the 
memoryless l-to-2 broadcast PEC, which is in sharp contrast 
with the classic result that feedback does not increase the 
capacity for any memoryless 1-to-l channel. J9) can also 
be viewed as a mirroring result to the achievability results 
of GBCs with COF (T8). It is worth noting that other than 
increasing the achievable throughput, COF can also be used for 
queue and delay management IfTTI . l20l and for rate-control 
in a wireless network coded system [FPU . 

The main contribution of this work includes: (i) The capac- 
ity region for general l-to-3 broadcast PECs with COF; (ii) 

T h e re sults of 1-to-i^ broadcast PECs with message side informa tion 
1211 . 1221 is related to the capacity of the "XOR-in-the-air" scheme II II 
in a wireless network. 



The capacity region for two types of 1-to-K broadcast PECs 
with COF: the symmetric PECs, and the spatially independent 
PECs with one-sided fairness constraints; and (iii) A pair of 
outer and inner bounds of the capacity region for general 1 -to- 
A broadcast PECs with COF, which can be easily evaluated 
by any linear programming solver. Extensive numerical exper- 
iments show that the outer and inner bounds meet for almost 
all broadcast PECs encountered in practical scenarios and thus 
effectively bracket the exact capacity region. 

The capacity outer bound in this paper is derived by 
generalizing the degraded channel argument first proposed in 
|[l"8l . For the achievability part of (i), (ii), and (iii), we devise 
a new class of inter-session network coded schemes, termed 
the packet evolution method. The packet evolution method is 
based on a novel concept of network code alignment, which 
is the PEC-counterpart of the interference alignment method 
originally proposed for Gaussian interference channels J3), J7)- 
It is worth noting that in addition to the random PEC model 
in this paper, there are other promising channel models that 
also greatly facilitate capacity analysis for larger networks. 
One such example is the deterministic wireless channel model 
proposed in (TJ, which can also be viewed as a deterministic 
degraded binary erasure channel. 

The rest of this paper is organized as follows. Section HI] 
contains the basic setting as well as the detailed comparison to 
the existing results in (9), 03], |fl9l via an illustrating example. 
Section [III] describes the main theorems of this paper and the 
proof of the converse theorem. In particular, Section IIII-AI 
focuses on the capacity results for arbitrary broadcast PEC 
parameters while Section HlI-BI considers two special types of 
broadcast PECs: the symmetric and the spatially independent 
PECs, respectively. Section [IV] introduces a new class of 
network coding schemes, termed the packet evolution (PE) 
method. Based on the PE method, Section |V] outlines the 
proofs of the achievability results in Section UTU Some theo- 
retic implications and discussions are included in Section |VT] 
Section IVHI concludes this paper. 

II. Problem Setting & Existing Results 
A. The Memoryless 1-to-K Broadcast Packet Erasure Channel 

For any positive integer A, we use [A] = {1, 2, • • • , A} to 
denote the set of integers from 1 to K, and use 2^1 to denote 
the collection of all subsets of [A]. 

Consider a 1-to-K broadcast PEC from a single source s 
to K destinations dk, k £ [K]. For each channel usage, the 
1-to-A" broadcast PEC takes an input symbol Y £ GF(g) from 

s and outputs a A'-dimensional vector Z = {Z\, ■ ■ ■ , Zk) £ 
({Y}U{*}) K , where the fc-th coordinate Zk being "*" denotes 
that the transmitted symbol Y does not reach the fc-th receiver 
dk (thus being erased). We also assume that there is no other 
type of noise, i.e., the individual output is either equal to the 
input Y or an erasure "*." The success probabilities of a 1-to- 
K PEC are described by 2 K non-negative parameters: P S ^ K ^ S 

for all S £ 2^ such that ^2 S€2 i K ] Ps TkKs = m ^ f° r a ^ 



y e GF( ? ), 



Prob ( {k £ [K] :Z k = y} = S\Y = y)= p,^. 

That is, Pg TjHTs denotes the probability that the transmitted 
symbol Y is received by and only by the receivers {dk : 
k £ S}. In addition to the joint probability mass function 
Ps [K]\s °f t ' le success events, the following notation will be 
used frequently in this work. For all S 6 2^', we define 



PUS = PS>[K]W- 



(1) 



That is, p u s is the probability that at least one of the receiver 
dk in S successfully receives the transmitted symbol Y, For 
example, when K = 2, 

PU{1,2} =P{1}{2} +P{2}{!} + P{1,2}$ 

is the probability that at least one of d% and d 2 receives the 
transmitted symbol Y, We sometimes use pk as shorthand for 
Pu{k}, which is the marginal probability that the k-th receiver 
dk receives Y successfully. 

We can repeatedly use the channel for n time slots and let 
Y(t) and Z(i) denote the input and output for the i-th time 
slot. We assume that the 1-to-A broadcast PEC is memoryless 
and time-invariant, i.e., for any given function y(-) : [n] i-)- 
GF(q), 

Prob (Vt G [n], {k : Z k (t) = y(t)} = S(t) 

n 

\Vt£[n},Y(t)=y(t)) = l[ P 



t=i 



S(t)[K]\S(t) 



Note that this setting allows the success events among different 
receivers to be dependent, also defined as spatial dependence. 
For example, when two logical receivers d^ and dk 2 are 
situated in the same physical node, we simply set the P s jjq\s 
parameters to allow perfect correlation between the success 
events of dk 1 and dk 2 . Throughout this paper, we consider 
memoryless 1-to-A' broadcast PECs that may or may not be 
spatially dependent. 

B. Broadcast PEC Capacity with Channel Output Feedback 

We consider the following broadcast scenario from s to 
{dk ■ k £ [K]}. Assume slotted transmission. Source s is 
allowed to use the 1-to-A' PEC exactly n times and would 
like to carry information for K independent downlink data 
sessions, one for each dk, respectively. For each k £ [A], the 
k-th session (from s to dk) contains nRk information symbols 
Xfe = {Xkj £ GF(q),j £ [nRk]}, where Rk is the data rate 
for the (s,dk) session. All the information symbols Xkj for 
all k £ [A] and j £ [nRk] are independently and uniformly 
distributed in GF(g). 

We consider the setting with instant channel output feedback 
(COF). That is, for the t-th time slot, source s sends out a 
symbol 

Y(t) = f t ({X k : Vk £ [A]}, {Z(r) : r G [t - 1]}) , 
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Fig. 1. Illustration of a 1-to-ii" broadcast PEC with COF. 



which is a function f t ( ) based on the information symbols 
{X kJ } and the COF {Z(r) : r £ [t - 1]} of the previous 
transmissions. In the end of the rc-th time slot, each d k outputs 
the decoded symbols 



3 e[nR k ]} = g k ({Z k (t):Vte[n]}), 



where g k (-) is the decoding function of d k based on the corre- 
sponding observation Z k (t) for i £ [n]. Note that we assume 
that the PEC channel parameters vPgp^S : ^ e 2l A 'l j are 
available at s before transmission. See Fig. Q] for illustration. 

We now define the achievable rate of a 1-to-A' broadcast 
PEC with COF. 

Definition 1: A rate vector (R\, ■ ■ ■ ,Rk) is achievable if 
for any e > 0, there exist sufficiently large n and sufficiently 
large underlying finite field GF(q) such that 

Vfce [A'], ProbfXfe^X 



< e. 

Definition 2: The capacity region of a \-Xo-K broadcast 
PEC with COF is defined as the closure of all achievable rate 
vectors , ■ • • , Rk )■ 

C. Existing Results 

The capacity of l-to-2 broadcast PECs with COF has been 
characterized in |9l : 

Theorem 1 (Theorem 3 in /j9^J: The capacity region 
(Ri,R 2 ) of a l-to-2 broadcast PEC with COF is described 
by 

«»_ < 1 

(2) 




PU{1,2} — 
.PU{1,2} P2 — 

One scheme that achieves the above capacity region in (01 
is the 2-phase approach in |9). That is, for any (Ri,R 2 ) in 
the interior of (O, perform the following coding operations. 

In Phase 1, the source s sends out uncoded information 
packets Xij 1 and X 2 j 2 for all ji £ [nR\] and j 2 G [nR 2 ] 
until each packet is received by at least one receiver. Those 
Xi j x packets that are received by d\ have already reached 
their intended receiver and thus will not be retransmitted in 
the second phase. Those X\ t j 1 packets that are received by d 2 



0^ 



(a) Sending the first Phase-2 packet [X\ + X2]. 
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(b) The optimal coding operation after sending the [X\ + X2]. 
Fig. 2. Example of the suboptimality of the 2-phase approach. 



but not by d\ need to be retransmitted in the second phase, 
and are thus stored in a separate queue Q 1 . 2 j- Symmetrically, 
the X 2 j 2 packets that are received by d\ but not by d 2 
need to be retransmitted, and are stored in another queue 
<5 2 i2- Since those "overheard" packets in queues Q 1 . 2 j and 
Q 2 . 12 are perfect candidates for intersession network coding 
flU . they can be linearly mixed together in Phase 2. Each 
single coded packet in Phase 2 can now serve both d\ and 
d 2 simultaneously. The intersession network coding gain in 
Phase 2 allows us to achieve the capacity region in ©. 

Based on the same logic, [15] derives an achievability region 
for \-to-K broadcast PECs with COF under a perfectly sym- 
metric setting. The main idea can be viewed as an extension of 
the above 2-phase approach. That is, for Phase 1, the source s 
sends out all X k ,j, Vfc 6 [K],j £ [nR k ], until each of them is 
received by at least one of the receivers {d k : k £ [K]}. Those 
X k j packets that are received by d k have already reached their 
intended destination and will not be transmitted in Phase 2. 
Those X k j packets that are received by some other di but not 
by d k are the "overheard packets," and could potentially be 
mixed with packets of the z-th session. In Phase 2, source 
s takes advantage of all the coding opportunities created 
in Phase 1 and mixes the packets of different sessions to 
capitalize the network coding gain. fl9l implements such 2- 
phase approach while taking into account of various practical 
considerations, such as time-out and network synchronization. 

D. The Suboptimality of The 2-Phase Approach 

Although being throughput optimal for the simplest K = 2 
case, the above 2-phase approach does not achieve the capacity 
for the cases in which K > 2. To illustrate this point, consider 
the example in Fig. |2] 

In Fig. |2(a)| source s would like to serve three receivers 
d± to d^. Each (s,d k ) session contains a single information 
packet X k , and the goal is to convey each X k to the intended 
receiver d k for all k — 1,2,3. Suppose the 2-phase approach 



in Section Hl-CI is used. During Phase 1, each packet is sent 
repeatedly until it is received by at least one receiver, which 
either conveys the packet to the intended receiver or creates 
an overheard packet that can be used in Phase 2. Suppose 
after Phase 1, d\ has received X 2 and X3, d 2 has received 
X\ and X3, and ^3 has not received any packet (Fig. |2(a)| . 
Since each packet has reached at least one receiver, source s 
moves to Phase 2. 

One can easily check that if s sends out a coded packet 
[Xi + X 2 ] in Phase 2, such packet can serve both d\ and 
d 2 . That is, d\ (resp. d 2 ) can decode Xi (resp. X 2 ) by 
subtracting X 2 (resp. X{) from [X1+X2]. Nonetheless, since 
the broadcast PEC is random, the coded packet [X\ + X 2 ] 
may or may not reach d\ or d 2 . Suppose that due to random 
channel realization, [X\ + X 2 ] reaches only d%, see Fig. |2(a)| 
The remaining question is what s should send for the next 
time slot. For the following, we compare the existing 2-phase 
approach and a new optimal decision. 

The existing 2-phase approach: We first note that since 
g?3 received neither X\ nor X 2 in the past, the newly received 
\X\ + X 2 ] cannot be used by ti 3 to decode any information 
packet. In the existing results J9), US), lH9l , d 3 thus discards 
the overheard [X\ +X^\, and s would continue sending [X\ + 
X 2 ] for the next time slot in order to capitalize this coding 
opportunity created in Phase 1. 

The optimal decision: It turns out that the broadcast system 
can actually benefit from the fact that d% overhears the coded 
packet [X\ + X 2 ] even though neither X\ nor X 2 can be 
decoded by tij. More explicitly, instead of sending [X\ +X 2 ], 
s should send a new packet [Xi + X 2 + X3] that mixes all 
three sessions together. With the new [X\ + X 2 + X3} (see 
Fig. |2(b)| for illustration), d\ can decode the desired X\ by 
subtracting both X 2 and X3 from [X\ + X 2 + X3]. d 2 can 
decode the desired X 2 by subtracting both X% and X3 from 
[X\ + X 2 + X3]. For d^, even though d$ does not know the 
values of X\ and X 2 , d% can still use the previously overheard 
[Xi + X 2 ] packet to subtract the interference (X\ + X 2 ) from 
[X\ +X 2 + X3] and decode its desired packet X3. As a result, 
the new coded packet [X\ + X 2 + X3] serves all destinations 
di, d 2 , and 1^3, simultaneously. This new coding decision thus 
strictly outperforms the existing 2-phase approach. 

Two critical observations can be made for this example. 
First of all, when 1^3 overhears a coded [X\ + X 2 ] packet, 
even though ^3 can decode neither X\ nor X 2 , such new 
side information can still be used for future decoding. More 
explicitly, as long as s sends packets that are of the form 
a{Xi + X 2 ) + PX 3 , the "aligned interference" a(Xi + X 2 ) 
can be completely removed by Gfa without decoding individual 
Xi and X 2 . This technique is thus termed "code alignment," 
which is in parallel with the interference alignment method 
used in Gaussian interference channels (3 j. Second of all, in 
the existing 2-phase approach, Phase 1 has the dual roles of 
sending uncoded packets to their intended receivers, and, at the 
same time, creating new coding opportunities (the overheard 
packets) for Phase 2. It turns out that this dual-purpose Phase- 
1 operation is indeed optimal (as will be seen in Sections ITVl 
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Fig. 3. Illustration of the proof of Proposition [T] 



and|V]i. The suboptimality of the 2-phase approach for K > 2 
is actually caused by the Phase-2 operation, in which source s 
only capitalizes the coding opportunities created in Phase 1 but 
does not create any new coding opportunities for subsequent 
packet mixing. One can thus envision that for the cases K > 2, 
an optimal policy should be a multi-phase policy, say an M- 
phase policy, such that for all i £ [M — 1] (not only for the 
first phase) the packets sent in the i-th phase have dual roles 
of sending the information packets to their intended receivers 
and simultaneously creating new coding opportunities for the 
subsequent Phases (i + 1) to M. These two observations will 
be the building blocks of our achievability results. 

III. The Main Results 

We have two groups of results: one is for general l-to-i<f 
broadcast PECs with arbitrary values of the PEC parameters, 
and the other is for \-lo-K broadcast PECs with some restric- 
tive conditions on the values of the PEC parameters. 

A. Capacity Results For General 1-to-K Broadcast PECs 

We define any bijective function 7r : [K] n> [K] as 
a A'-permutation and we sometimes just say that n is a 
permutation whenever it is clear from the context that we are 
focusing on [K]. There are totally K\ distinct if -permutations. 
Given any X-permutation it, for all j G [K] we define 
SJ = {7r(Z) : yi 6 [j]} as the set of the first j elements 
according to the permutation ir. We then have the following 
capacity outer bound for any 1-to-K broadcast PEC with COF. 

Proposition 1: Recall the definition of pus m O- Any 
achievable rates (Ri, ■ ■ ■ , Rk) must satisfy the following K\ 
inequalities: 



K 



Pus? 



(3) 



Proof: Proposition[T]can be proven by a simple extension 
of the outer bound arguments used in (9), lfT8l . (Note that when 
K = 2, Proposition [TJ collapses to Theorem 3 of |9l.) 

For any given permutation ir, consider a new broadcast 
channel with (K — 1) artificially created information pipes 
connecting all the receivers di to dx- More explicitly, for all 
j G [if — 1], create an auxiliary pipe from d^fj\ to d^tj+xy See 



Fig.[3]for illustration. With the auxiliary pipes, any destination 
d-K{j), j £ [K], not only observes the corresponding output 



of the broadcast PEC but also has all the information 



Zn(i) of its "upstream receivers" for all / G [j — 1]. Since 
we only create new pipes, any achievable rates of the original 
1-to-K broadcast PEC with COF must also be achievable 
in the new 1-to-A' broadcast PEC with COF in Fig. [3] The 
capacity of the new l-to-K broadcast PEC with COF is thus an 
outer bound on the capacity of the original l-to-K broadcast 
PEC with COF. 

On the other hand, the new l-to-K broadcast PEC in Fig. [3] 
is a physically degraded broadcast channel with the new 
success probability of d k being Rjsj instead of p^^) ( see 
Fig. |3J. [HD shows that COF does not increase the capacity 
of any physically degraded broadcast channel. Therefore the 
capacity of the new l-to-K broadcast PEC with COF is 
identical to the capacity of the new l-to-if broadcast PEC 
without COF. Since (f3]) is the capacity of the new l-to-K 
broadcast PEC without COF, (0 must be an outer bound of the 
capacity of the original l-to-K PEC with COF. By considering 
different permutation it, the proof of PropositionQ]is complete. 

■ 

For the following, we first provide the capacity results for 
general l-to-3 broadcast PECs. We then state an achievability 
inner bound for general l-to-K broadcast PECs with COF 
for arbitrary K values, which, together with the outer bound 
in Proposition Q] can effectively bracket the capacities for the 
cases in which K > 4. 

Proposition 2: For any parameter values 
{ps {i 2 3}\s : ^ S e 2< 1 ' 2 < 3 } j of a l-to-3 broadcast PEC, the 
capacity outer bound in Proposition [T] is indeed the capacity 
region of a l-to-3 broadcast PEC with COF. 

To state the capacity inner bound, we need to define an 
additional function: f p (ST), which takes an input ST of two 
disjoint sets S,T G 2^. More explicitly, we define f p (ST) as 
the probability that a packet Y, transmitted through the l-to-K 
PEC, is received by all those d{ with i e S and not received 
by any dj with j G T. For example, /p^A']^) = p S t K i\ S 
for all S G 2^1 For arbitrary disjoint S and T, we thus have 



f P (ST) ± 



E PSi[K]\Si- ( 4 ) 

VS 1 :SCS 1 ,TC([K]\S 1 ) 



We also say that a strict total ordering "-<" on 2^ is 
cardinality-compatible if 

VS 1 ,S 2 €2^ K \ \Si\ < \S 2 \ => Si ~< S 2 . (5) 

For example, for K = 3, the following strict total ordering 

-< {2} ~< {1} < {3} -< {1,2} -< {1,3} -< {2,3} < {1,2,3} 

is cardinality-compatible. 

Proposition 3: Fix any arbitrary cardinality-compatible, 
strict total ordering -<. For any general l-to-K broadcast PEC 
with COF, a rate vector ■ • • , Rk) can be achieved by a 
linear network code if there exist 2 K non-negative x variables, 



indexed by S E 2^: 

{x s > : VS E 2 M } 



(6) 



and K3 K 1 non-negative w variables, indexed by (fc; S — > T) 
satisfying T C S C {[K]\k): 

[w k , s ^ T > : Vfc G [K),VS,Te 2^, 

satisfying TCSC ([A']\fc)} , (7) 

such that jointly the following linear inequalities^ are satisfied: 

x s< 1 (8) 

VS:Se2l A 'l 

VT G 2 [A ' ] ,Vfc G T, 

XT > ^2 W k . S ^(T\k) (9) 

VS:(T\fc)CSC([_fs"]\fc) 

Vfc G [K], w k .^^ -p U [ K ] > Rk (10) 
VfcG [K],VSC([K]\k),S^$, 

^ WkiS^-T! J Pv{[K]\S) > 

E 



w k . s ^ Tl ■ f P (5\T!)([Jf]\S) 



VSi , Ti : such that 
Ti C Si C ([K]\k), 
Ti C S, S £ Si 



(ID 



Vfc G [K],S, T G 2^1 satisfying TCSC ([K]\k),T ^ S, 
( \ 



Wk-S->T + 



E 



V 



VTi C S : 
(TiU{fc}) -< (Tu{fe» 



Pu([K]\S) 



< 



E w k , Sl ^ T -f p ((S\T)([K]\S) 



VSi : Si -< S, 
TCSiC {[K]\k) 



E 



Wk-Si^T! ■ 



VSi . Ti : such that 
Ti C Si C ([K}\k), 
(Ti U{fe}) -< (Tu{fe», 
Ti C S, S g Si 



/„ ((5\r 1 )([if]\5) 



(12) 

Since Proposition [3] holds for any cardinality-compatible, 
strict total ordering -<. We can easily derive the following 
corollary: 

To distinguish different strict total orderings, we append 
a subscript / to -<. For example, -<i and -<2 correspond to 
two distinct strict total orderings. Overall, there are L = 



IlfcLo ( (fc)'J distinct strict tota l ordering -<u \fl G [L], that 
are cardinality-compatible. 

2 There are totally (1 + K2 K ~ 1 +K3 K ~ 1 ) inequalities. More explicitly, 
(8) describes one inequality. There are K2 K ~ 1 inequalities having the form 
ofj9). Th ere are totally KZ K ~^ inequalities having the form of one of H0\ . 
03}, and 02). For comparison, the outer bound in Proposition [factually has 
more inequalities asymptotically (K\ of them) than those in Proposition [3] 



Corollary 1: For any given cardinality-compatible strict to- 
tal ordering -</, we use A; to denote the collection of all 
(Ri, ■ ■ ■ , Rk) rate vectors satisfying Proposition [3] Then the 
convex hull of Co ({A; : VZ G [L]}) is an achievable region of 
the given 1-to-AT broadcast PEC with COF. 

Remark: For some general classes of PEC parameters, one 
can prove that the inner bound of Proposition [3] is indeed the 
capacity region for arbitrary K > 4 values. Two such classes 
are discussed in the next subsection. 

B. Capacity Results For Two Classes of 1-to-K Broadcast 
PECs 

We first provide the capacity results for symmetric broadcast 
PECs. 

Definition 3: A 1-to-A' broadcast PEC is symmetric if the 
channel parameters ^P s Jk]\s : ^ £ 2^ j satisfy 

VSi,S a G 2^ with |^| = |S 2 |, p SiWm = p SaMW 

That is, the success probability Pg T^jrg depends only on 
the size of S, and does not depend on which subset of receivers 
being considered. 

Proposition 4: For any symmetric 1-to-A' broadcast PEC 
with COF, the capacity outer bound in Proposition [T] is indeed 
the corresponding capacity region. 

The perfect channel symmetry condition in Proposition [4] 
may be a bit restrictive for real environments as most broadcast 
channels are non-symmetric. A more realistic setting is to al- 
low channel asymmetry while assuming spatial independence 
between different destinations 

Definition 4: A 1-to-A' broadcast PEC is spatially indepen- 
dent if the channel parameters |?>g [ g ]\g : VS 1 G 2^ \ satisfy 



VS G 2^ K \ p 



S[K\\S 




n (!-**) 

ke[K]\S 



where pk is the marginal success probability of destination dk- 
Note: A symmetric 1-to-AT broadcast PEC needs not be spa- 
tially independent. A spatially independent PEC is symmetric 

if Pi = P2 = ■ ■ ■ = Pk ■ 

To describe the capacity results for spatially independent 
1-to-AT PECs, we need the following additional definition. 

Definition 5: Consider a 1-to-AT broadcast PEC with 
marginal success probabilities p\ to pk- Without loss of 
generality, assume pi < P2 < • • • < Pk, which can be 
achieved by relabeling. We say a rate vector (Ri, ■ ■ ■ , Rk) 
is one-sidedly fair if 

<j, Ri(l-Pi) > JZ^l-pj). 

We use A os f to denote the collection of all one-sidedly fair 
rate vectors. 

The one-sided fairness contains many practical scenar- 
ios of interest. For example, the perfectly fair rate vector 
(R, R, ■ ■ ■ ,R) by definition is also one-sidedly fair. Another 
example is when min(pi,-- - ,pk) > \ and we allow the 
rate Rk to be proportional to the corresponding marginal 



success probability pk, i.e., Rk = pkR, then the rate vector 
(piR,P2R, ■ ■ ■ ,PkR) is also one-sidedly fair. 

For the following, we provide the capacity of spatially 
independent 1-to-AT PECs with COF under the condition of 
one-sided fairness. 

Proposition 5: Suppose the 1-to-A' PEC of interest is spa- 
tially independent with marginal success probabilities < 
Pi < P2 < • • • < Pk- Any one-sidedly fair rate vector 
(Ri,-- - ,Rk) G A os f is in the capacity region if and only 
if (Ri, ■ ■ ■ , Rk) G A 0S f satisfies 



K 

E 

k=l 



Rk 



— < 1. (13) 

nii(i-pi) ~ 

Proposition [5] implies that Proposition [T] is indeed the 
capacity region when focusing on the one-sidedly fair rate 
region A osf . 

IV. The Packet Evolution Schemes 

For the following, we describe a new class of coding 
schemes, termed the packet evolution (PE) scheme, which 
embodies the concept of code alignment and achieves (near) 
optimal throughput. The PE scheme is the building block of 
the capacity / achievability results in Section [III] 

A. Description Of The Packet Evolution Scheme 

The packet evolution scheme is described as follows. Recall 
that each (s,dk) session has nRk information packets Xk t i 
to Xk, n R k ■ We associate each of the X^fcLi n Rk informa- 
tion packets with an intersession coding vector v and a set 
S Q [AT]. An intersession coding vector is a (j2k=i n ^-kj- 
dimensional row vector with each coordinate being a scalar 
in GF(q). Before the start of the broadcast, for any k G [AT] 
and j G [nRk] we initialize the corresponding vector v of 
Xkj in a way that the only nonzero coordinate of v is the 
coordinate corresponding to Xkj and all other coordinates 
are zero. Without loss of generality, we set the value of the 
only non-zero coordinate to one. That is, initially the coding 
vectors v are set to the elementary basis vectors of the entire 
(^2^! nRkj -dimensional message space. 

For any k G [AT] and j G [nRk] the set S of Xk.j is 
initialized to 0. As will be clear shortly after, we call S the 
overhearing seQ of the packet Xkj- For easier reference, we 
use v(Xk.j) and S(Xk.j) to denote the intersession coding 
vector and the overhearing set of Xk.j- 

Throughout the n broadcast time slots, source s constantly 
updates the S(Xkj) and v(Xkj) according to the COF. The 
main structure of a packet evolution scheme can now be 
described as follows. 

§ The Packet Evolution Scheme 
1: Source s maintains a single flag f c hange- Initially, set 



change 



l. 



3 Unlike the existing results II II . in this work the overhearing set does 
not mean that the receivers di in S(Xi c j) have known the value of X^-j- 
Detailed discussion of the overhearing set S(Xk.j) are provided in Lemma|2] 



2: for t = 1, • • ■ ,n, do 

3: In the beginning of the t-th time slot, do Lines l4l to [TOl 



if Change = 1 then 



Choose a non-empty subset T C [K]. 
Run a subroutine PACKET SELECTION, which takes 
T as input and outputs a collection of \T\ packets 
{X k) j k : Vfc G T}, termed the target packets, for 
which all X ktjh satisfy (S(X ktjlt )U {k}) D T. 
7: Generate |T| uniformly random coefficients c k <E 
GF(q) for all k S T and construct an intersession 
coding vector v tx <- Y^keT c k • v {X kJk ). 

8: Set fchange ^ 0. 

9: end if 

10: Sends out a linearly intersession coded packet according 
to the coding vector v tx . That is, we send 



Y tx — v K • (-Xi,i, 



,X 



K,nR 



11: 

12: 

13 
14 
15 



where (Xi.i, ■ ■ ■ , Xk,uR k ) t is a column vector con- 
sisting of all information symbols 
In the end of the t-th time slot, use a subroutine 
Update to revise the v(X k j k ) and S(X k j k ) values 
of all target packets X k j k based on the COF. 
if the S(X k j k ) value changes for at least one target 
packet X k j k after the UPDATE then 

Set fchange ^ 1- 

end if 
end for 



In summary, a group of target packets {X k .j k } are selected 
according to the choice of the subset T, The corresponding 
vectors {v(Xfc.j fc )} are used to construct a coding vector v tx . 
The same coded packet Y tx , corresponding to v tx , is then sent 
repeatedly for many time slots until one of the target packets 
X k j k evolves (when the corresponding S(X k j k ) changes). 
Then a new subset T is chosen and the process is repeated 
until we use up all n time slots. Three subroutines are used 
as the building blocks of a packet evolution method: (i) How 
to choose the non-empty T C [K]; (ii) For each k 6 [K], 
how to select a single target packets X k .j k among all X k j 
satisfying (S(X k j) U {k}) D T; and (iii) How to update the 
coding vectors v(X k j k ) and the overhearing sets S(X k j k ). 
For the following, we first describe the detailed update rules. 

§ Update of S(X kJk ) and v(X ktjk ) 
1: Input: The T and v tx used for transmission in the current 
time slot; And S m , the set of destinations di which receive 
the transmitted coded packet in the current time slot. (S^ 
is obtained through the COF in the end of the current time 
slot.) 

4 It is critical to note that the coding operation is based purely on v tx rather 
than on the list of the target packets X k j k . Once v tx is decided, we create 
a new coded packet based on the coordinates of v tx . It is possible that v tx 
has non-zero coordinates corresponding to some Xy j that are not one of 
the target packets X k j k . Those Xy j will participate in creating the coded 
packet. 



2: for all k e T do 

3: if SW g S(X k<jh ) then 

4: Set S(X kjk ) <— (T n S(X kdk )) U S r; 

5: Set v(X k<jk ) <- Vtx. 

6: end if 

7: end for 



An Illustrative Example Of The PE Scheme: 

Let us revisit the optimal coding scheme of the example 
in Fig. |2] of Section III-DI Before broadcast, the three infor- 
mation packets X\ to X 3 have the corresponding v and S: 

v(Xt) = (1,0,0), v(X 2 ) = (0,1,0), and v(X 3 ) - (0,0,1), 
and S{Xi) = S(X 2 ) = S(X 3 ) = 0. We use the following 
table for summary. 



X x : (1,O,O),0 X 2 : (O,1,O),0 X 3 : (0,0,1) 



Consider a duration of 5 time slots. 

Slot 1: Suppose that s chooses T = {1}. Since (0U{1» 2 
T, Packet Selection outputs Xi. The coding vector v tx 
is thus a scaled version of v(Xi) = (1,0,0). Without loss of 
generality, we choose v tx = (1, 0, 0). Based on v tx , s transmits 
a packet 1X\ + 0X 2 + 0X 3 = X\. Suppose [Xi] is received 
by d 2 , i.e., S a = {2}. Then during UPDATE, S IX = {2} £ 
S(Xi) = 0. Update thus sets Spfi) = {2} and v(X 1 ) = 
Vtx = (1; 0, 0). The packet summary becomes 



Xv (1,0,0),{2} X 2 : (O,1,O),0 X 3 : (0,0,1), 



Slot 2: Suppose that s chooses T = {2}. Since (0U{2}) D 
T, Packet Selection outputs X 2 . The coding vector v tx is 
thus a scaled version of v(X 2 ) = (0,1,0). Without loss of 
generality, we choose v tx = (0,1,0) and accordingly [X 2 ] is 
sent. Suppose [X 2 ] is received by d\, i.e., S a = {1}. Since 
Srx. S(X 2 ), after UPDATE the packet summary becomes 



Xf. (1,0,0),{2} X 2 : (0,1,0),{1} X 3 : (0,0,1), 



Slot 3: Suppose that s chooses T = {3} and PACKET 
Selection outputs X 3 . The coding vector v tx is thus a scaled 
version of v(X3) = (0,0,1), and we choose v tx = (0,0,1). 
Accordingly [X 3 ] is sent. Suppose [X 3 ] is received by di and 
d 2 , i.e., S rx = {1, 2}. Then after UPDATE, the packet summary 
becomes 



X i: (1,0,0),{2} X 2 : (0,1,0),{1} X 3 : (0,0,1),{1, 2} 



Slot 4: Suppose that s chooses T = {1, 2}. Since (S(X 1 ) U 
{1}) D T and (S(X 2 ) U {2}) D T, PACKET SELECTION 
outputs {Xi, X 2 }. Vtx is thus a linear combination of v(Xi) = 
(1, 0, 0) and v(X 2 ) = (0, 1, 0). Without loss of generality, we 
choose Vtx = (1,1,0) and accordingly [X\ + X 2 ] is sent. 
Suppose [Xi + X 2 ] is received by d 3 , i.e., S rx = {3}. Then 
during Update, for X u S IX = {3} %\ S(X X ) = {2}. Update 
thus sets S{Xy) = {2,3} and \{X X ) = v tx = (1,1,0). For 
X 2 , S a = {3} ^ S(X 2 ) = {1}. Update thus sets S(X 2 ) = 
{1,3} and v(X 2 ) = v tx = (1,1,0). The packet summary 
becomes 



Xf. (1,1,0),{2,3} 


X 2 : (1,1,0),{1,3} 


X 3 : (0,0,1),{1,2} 





Slot 5: Suppose that s chooses T = {1, 2, 3}. By Line|6]of 
The Packet Evolution Scheme, the subroutine Packet 
Selection outputs {Xi, X 2 ,X 3 }. v tx is thus a Unear com- 
bination of v(Xx) = (1,1,0), v(X 2 ) = (1,1,0), and 
v(X 3 ) = (0, 0, 1), which is of the form a(X 1 + X 2 ) + j3X 3 . 
Note that the packet evolution scheme automatically achieves 
code alignment, which is the key component of the optimal 
coding policy in Section IH-DI Without loss of generality, we 
choose a = /3 = 1 and v tx = (1, 1, 1). Y tx = [Xi + X 2 + X 3 ] 
is sent accordingly. Suppose [X\ + X 2 + X 3 ] is received by 
{d x ,d 2 ,d 3 }, i.e., S IX = {1,2,3}. Then after UPDATE, the 
summary of the packets becomes 



X V . (1,1,1),{1,2,3} 


X 2 : (1,1,1),{1,2,3} 


X 3 : (1,1,1),{1,2,3} 





From the above step-by-step illustration, we see that the 
optimal coding policy in Section III-DI is a special case of a 
packet evolution scheme. 

B. Properties of A Packet Evolution Scheme 

We term the packet evolution (PE) scheme in Section IIV-AI 
a generic PE method since it does not depend on how to 
choose T and the target packets Xkj k and only requires the 
output of Packet Selection satisfying (S(Xk,j k )U {k}) D 
T, Mk G T. In this subsection, we state some key properties 
for any generic PE scheme. The intuition of the PE scheme is 
based on these key properties and will be discussed further in 
Section HV-Cl 

We first define the following notation for any linear network 
codes. (Note that the PE scheme is a linear network code.) 

Definition 6: Consider any linear network code. For any 
destination dk, each of the received packet Z k (t) can be 
represented by a vector Wfc(t), which is a (j2 k=1 nRkj- 
dimensional vector containing the coefficients used to generate 
Z k (t). That is, Z k (t) = Wfc (t) • (X M ,--- ,X K>nRK ) T . If 
Zk(t) is an erasure, we simply set Wfc(f) to be an all-zero 
vector. The knowledge space of destination d k in the end of 
time t is denoted by Slz,k(t), which is the linear span of w k (r), 
t <t. That is, fi Zifc (i) = span(w fe (r) : Vr G [t]). 

Definition 7: For any non-coded information packet 
Xkj, the corresponding intersession coding vector is a 

(y2k—i nRkj -dimensional vector with a single one in the 
corresponding coordinate and all other coordinates being zero. 
We use 5kj to denote such a delta vector. The message space 
of dk is then defined as Q,m,U = span(<5fcj : Vj G [nR k ]). 

With the above definitions, we have the following straight- 
forward lemma: 

Lemma 1: In the end of time t, destination d k is able to 
decode all the desired information packets X k j, Vj G [nR k ], 
if and only if Cl M ,k Q &z,k(t)- 

We now define "non-interfering vectors" from the perspec- 
tive of a destination dk- 



Definition 8: In the end of time t (or in the beginning of 
time (t + 1)), a vector v (and thus the corresponding coded 
packet) is "non-interfering" from the perspective of d k if 

v g span(f2x,fc(t),Q M ,fe)- 

We note that any non-interfering vector v can always 
be expressed as the sum of two vectors v' and w, where 
v' G f^M.fc is a linear combination of all information vectors 
for dk and w G £lz,k(t) is a linear combination of all the 
packets received by dk- If v' = 0, then v = w is a transparent 
packet from d k 's perspective since d k can compute the value 
of w-(Xi i, • • ■ , XK, n R K ) T from its current knowledge space 
&z,k(t)- If v' ^ 0, then v = v' + w can be viewed as 
a pure information packet v' G &M,k after subtracting the 
unwanted w vector. In either case, v is not interfering with 
the transmission of the (s,d k ) session, which gives the name 
of "non-interfering vectors." 

The following Lemmas|2]and[3]discuss the time dynamics of 
the PE scheme. To distinguish different time instants, we add 
a time subscript and use St-i(X k ,j k ) and St(X k ,j k ) to denote 
the overhearing set of X k j k in the end of time (t—1) and t, 
respectively. Similarly, v t -i(X k j k ) and v t (X k j k ) denote the 
coding vectors in the end of time (t — 1) and t, respectively. 

Lemma 2: In the end of the t-th time slot, consider any 
X k j out of all the information packets X\ t x to Xk.uR k - Its 
assigned vector v t (X k j) is non-interfering from the perspec- 
tive of di for all i G (S t (X k ,j) U {k}). 

To illustrate Lemma [2] consider our 5-time-slot example. 
In the end of Slot 4, we have v(X x ) = (1,1,0) and 
S(X{) U {1} = {1,2,3}. From dis perspective, 0^,i(4) = 
span((0, 1, 0), (0, 0, 1)) and Q M ,i = span((l, 0, 0)). v'(Xi) G 
span(0^.i(4), J7m,i) is indeed non-interfering from d±'s per- 
spective. The same reasoning can be applied to d 2 to show 
that v(Xl) is non-interfering from d 2 s perspective. For 
da, tl Zt3 (4) = span((l,l,0)) and fi M , 3 = span((0, 0, 1)). 
v(Xi) G span(il^.3(4), J7m,3) is indeed non-interfering from 
d 3 's perspective. Lemma |2] holds for our illustrative example. 

Lemma 3: In the end of the t-th time slot, we use fln }k (t) 
to denote the remaining space of the PE scheme: 

f2fl,fc(t) = 

span(v t (X fcji ) : Vj G [nR k ] satisfying k g S t (X k ,j))- 

For any n and any e > 0, there exists a sufficiently large 
finite field GF(q) such that for all k G [K] and t G [n], 

Prob (span(n^ i jfe(t), ft.R,k(t)) = span(0^ ife (t), Q M ,k)) 
> 1 - e. 

Intuitively, Lemma [3] says that if in the end of time 
t we directly transmit all the remaining coded packets 
{v t (X ktj ) : \/j G [nRk],k £ S t (X kJ )} from s to d k through a 
noise-free information pipe, then with high probability, d k can 
successfully decode all the desired information packets X ky i 
to X k:n R k (see Lemma [TJ by the knowledge space ftz,k(t) 
and the new information of the remaining space flR tk (t). 

Lemma [3] directly implies the following corollary. 



Corollary 2: For any n and any e > 0, there exists a 
sufficiently large finite field GF(g) such that the following 
statement holds. If in the end of the n-th time slot, all 
information packets Xk.j have S n (Xkj) 3 k, then 

Prob(Vfc, dk can decode all its desired {Xk.j}) > 1 — e. 

Proof: If in the end of the n-th time slot, all Xkj 
have S n {Xk,j) 3 k, then the corresponding Q,R t k(n) = {0} 
contains only the origin for all k G [K\. Therefore, Corollary |2] 
is simply a restatement of Lemmas Q] and [3] ■ 

To illustrate Corollary [2] consider our 5-time-slot example. 
In the end of Slot 5, since k G S(X k ) for all k e {1,2,3}, 
Corollary |2] guarantees that with high probability all dk can 
decode the desired Xk, which was first observed in the 
example of Section Hl-DI 

The proofs of Lemmas [2] and [3] are relegated to Appen- 
dices [A] and [B] respectively. 

C. The Intuitions Of The Packet Evolution Scheme 

Lemmas [2] and [3] are the key properties of a PE scheme. In 
this subsection, we discuss the corresponding intuitions. 

Receiving the information packet Xkj: Each informa- 
tion packet keeps a coding vector -v(Xkj). Whenever we 
would like to communicate Xkj to destination dk, instead of 
sending a non-coded packet Xkj directly, we send an inters- 
ession coded packet according to the coding vector \r(Xkj). 
Lemma [3] shows that if we send all the coded vectors v(Xkj) 
that have not been heard by dk (with k ^ S(Xk.j)) through 
a noise-free information pipe, then dk can indeed decode all 
the desired packets Xk,j with close-to-one probability. It also 
implies, although in an implicit way, that once a v(Xkj ) is 
heard by dk for some jo (therefore k G S(Xk,j )), there is no 
need to transmit this particular v(Xfc JO ) in the later time slots. 
Jointly, these two implications show that we can indeed use 
the coded packet v(Xk,j) as a substitute for Xkj without 
losing any information. In the broadest sense, we can say 
that dk receives a packet Xk.j if the corresponding v(Xkj) 
successfully arrives dk in some time slot t. 

For each Xkj, the set S(Xkj) serves two purposes: (i) 
Keep track of whether its intended destination dk has received 
this Xk j (through the v(Xk j)), and (ii) Keep track of whether 
v(Xk.j) is non-interfering to other destinations di, i ^ k. We 
discuss these two purposes separately. 

Tracking the reception of the intended dk- We first note 
that in the end of time 0, dk has not received any packet and 
we indeed have k ^ S(Xkj) — 0. We then notice that for 
any given Xk.j, the set S(Xkj) evolves over time. By Line|4] 
of the Update, we can prove that as time proceeds, the first 
time to such that k G S(Xk.j) must be the first time when 
Xkj is received by dk (i.e., Xk.j is chosen in the beginning 
of time t and k G S a in the end of time t). One can also show 
that for any Xkj once k G St (Xk j) in the end of time to 
for some to, we will have k G St(Xkj) for all t > to- By the 
above reasonings, checking whether k G S(Xkj) indeed tells 
us whether the intended receiver dk has received Xkj. 



Tracking the non-interference from the perspective of 
di 7^ dk'. Lemma [2] also ensures that v(Xk.j) is non- 
interfering from di's perspective for any i G S(Xkj), i ^ k. 
Therefore S(Xkj) successfully tracks whether v(Xkj) is 
non-interfering from the perspectives of di, i ^ k. 

Serving multiple destinations simultaneously by mixing 
non-interfering packets: The above discussion ensures that 
when we would like to send an information packet Xkj k to 
dk, we can send a coded packet \r(Xkj k ) as an information- 
lossless substitute. On the other hand, by Lemma such 
v(Xk.j k ) is non-interfering from di's perspective for all 
i G (S(Xkj k ) U {k}). Therefore, instead of sending a single 
packet v(Xkj k ), it is beneficial to combine the transmission 
of two packets v(Xk.j k ) and v(X; J; ) together, as long as 
I G S(Xkj k ) and k G S(Xij l ). More explicitly, suppose we 
simply add the two packets together and transmit a packet 
corresponding to [\r(Xkj k )+v(Xij t )]. Since v(Xkj k ) is non- 
interfering from di's perspective, it is as if di directly receives 
v(Xij l ) without any interference. Similarly, since ^(Xij^ is 
non-interfering from d^'s perspective, it is as if dk directly 
receives v(Xk,j k ) without any interference. By generalizing 
this idea, a PE scheme first selects a T G [K] and then 
choose all Xkj k such that k G T and v(Xk.j k ) are non- 
interfering from di's, perspective for all I G T\k (see Line [6] 
of the PE scheme). This thus ensures that the coded packet v tx 
in Line [7] of the PE scheme can serve all destinations k G T 
simultaneously. 

Creating new coding opportunities while exploiting the 
existing coding opportunities: As discussed in the example 
of Section III-DI the suboptimality of the existing 2-phase 
approach for K > 3 destinations is due to the fact that it 
fails to create new coding opportunities while exploiting old 
coding opportunities. The PE scheme was designed to solve 
this problem. More explicitly, for each Xkj the v(Xk.j) is 
non-interfering for all di satisfying i G (S(Xkj) U {k}). 
Therefore, the larger the set S(Xkj) is, the larger the number 
of sessions that can be coded together with v(Xkj). To 
create more coding opportunities, we thus need to be able 
to enlarge the S(Xkj) set over time. Let us assume that the 
Packet Selection in Line |6] chooses the X k . 3 such that 
S(Xkj) = T\k. That is, we choose the Xk.j that can be mixed 
with those (s, d{) sessions with I G S(Xkj) U {fc} = T. Then 
Line|4]of the UPDATE guarantees that if some other di, i £ T, 
overhears the coded transmission, we can update S(Xkj) with 
a strictly larger set S(Xkj) U S rx . Therefore, new coding 
opportunity is created since we can now mix more sessions 
together with Xkj. Note that the coding vector v(Xkj) is 
also updated accordingly. The new v(Xkj) represents the 
necessary "code alignment" in order to utilize this newly 
created coding opportunity. The (near-) optimality of the PE 
scheme is rooted deeply in the concept of code alignment, 
which aligns the "non-interfering subspaces" through the joint 
use of S(Xkj) and v(Xk.j). 



V. Quantify The Achievable Rates of PE Schemes 



definition, we must have 



In this section, we describe how to use the PE schemes 
to attain the capacity of l-to-3 broadcast PECs with COF 
(Proposition [2]), the achievability results for general 1-to- 
K broadcast PEC with COF (Proposition [3]), the capacity 
results for symmetric broadcast PECs (Proposition 2]i and for 
spatially independent PECs with one-sided fairness constraints 
(Proposition O. 

We first describe a detailed construction of a capacity- 
achieving PE scheme for general l-to-3 broadcast PECs with 
COF in Section IV- Al and then discuss the corresponding high- 
level intuition in Section IV-BI The high-level discussion will 
later be used to prove the achievability results for general 1-to- 
K broadcast PEC with COF in Section rV-Cl The proofs of the 
capacity results of two special classes of PECs are provided 
in Section |V-Dl 



A. Achieving the Capacity of l-to-3 Broadcast PECs With 
COF — Detailed Construction 

Consider a l-to-3 broadcast PEC with arbitrary channel 
parameters {p s ji 2 3}\s l- Without loss of generality, assume 
that the marginal success probability pk > for k = 1,2,3. 
For the cases in which pk — for some k, such dk cannot 
receive any packet. The l-to-3 broadcast PEC thus collapses 
to a l-to-2 broadcast PEC, the capacity of which was proven 
in E). 

Given any arbitrary rate vector (R\, i?2, R3) that is in 
the interior of the capacity outer bound of Proposition Q] 
our goal is to design a PE scheme for which each dk can 
successfully decode its desired packets {X^.j : Vj G [nR^]}, 
for k £ {1, 2, 3}, after n usages of the broadcast PEC. Before 
describing such a PE scheme, we introduce a new definition 
and the corresponding lemma. 

Given a rate vector (i?i, i?2, -R3) and the PEC channel 
parameters {p s ^ 2 3}\s i' we sa y mat destination dj dominates 



another dk, i =/= k if 



Ri 



1 



1 



Pu({l,2,3}\fc) Rj{l,2,3} 
>Rk ' 



1 



Pu({l,2,3}\i) Rj{l,2,3} 



(14) 



Lemma 4: For distinct values of i, k, I £ {1,2,3}, if di 
dominates dk, and dk dominates di, then we must have di 
dominates d[. 

Proof: Suppose this lemma is not true and we have di 
dominates dk, dk dominates di, and di dominates di. By 



Ri 



Rk 



Ri 



1 



1 



Pu({l,2,3}\fe) Pu{l,2,3} 
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(16) 



(17) 



We then notice that the product of the left-hand sides of dT5| ), 
( fT6b . and (fTTI i equals the product of the right-hand side of (l5l , 
( fTB . and dTTb . As a result, all three inequalities of (fHT l. ( fT6b . 
and ( fTTI i must also be equalities. Since ( fTTI i is an equality, we 
can also say that di dominates di . The proof of Lemma @] is 
complete. ■ 

By Lemma @] we can assume that d\ dominates cfo, ^2 
dominates d%, and d\ dominates d$, which can be achieved 
by relabeling the destinations dk- We then describe a detailed 
capacity-achieving PE scheme, which has four major phases. 
The dominance relationship is a critical part in the pro- 
posed PE scheme. The high-level discussion of this capacity- 
achieving PE scheme will be provided in Section IV-BI 

Phase 1 contains 3 sub-phases. In Phase 1.1, we always 
choose T = {1} for the PE scheme. In the beginning of time 1, 
we first select Xi t i. We keep transmitting the uncoded packet 
according to v(Xi.i) = 61.1 until it is received by at least 
one of the three destinations {d\, cfe, <fe}. Update its S(X\ t i) 
and v(Xi.i) according to the UPDATE rule. Then we move to 
packet Xi t 2- Keep transmitting the uncoded packet according 
to v(Xi 2) = <5i.2 until it is received by at least one of the 
three receivers {^1,^2,^3}. Update its S(Xi } 2) and v(Xl ; 2) 
according to the UPDATE rule. Repeat this process until all 
Xij, j £ [nRi] is received by at least one receiver. By the 
law of large numbers, Phase 1 . 1 will continue for 



nR\ 

Pu{l,2,3} 



time slots. 



(18) 



Phase 1.2: After Phase 1.1 we move to Phase 1.2. In 
Phase 1.2, we always choose T = {2} for the PE scheme. 
In the beginning of Phase 1.2, we first select X2.1. We keep 
transmitting the uncoded packet according to v(X2.i) = £2,1 
until it is received by at least one of the three destinations 
{dijcfejcfe}- Update its S(X2,i) and v(X2,i). Repeat this 
process until all X2J, j £ [71.R2] is received by at least one 
receiver. By the law of large numbers, Phase 1.2 will continue 
for 



ni?2 

Pu{l,2,3} 



time slots. 



(19) 



Phase 1.3: After Phase 1.2 we move to Phase 1.3. In 
Phase 1.3, we always choose T = {3} for the PE scheme. 
We repeat the same process as in Phases 1.1 and 1.2 until all 
X 3 j, j £ [nR 3 ] is received by at least one receiver. By the 
law of large numbers, Phase 1.3 will continue for 



nR 3 



time slots. 



Rj{l,2,3} 



(20) 



Phase 2: After Phase 1.3, we move to Phase 2. Phase 2 
contains 3 sub-phases. In Phase 2.1, we always choose 
T = {2, 3} for the PE scheme. Consider all the packets X2J 
that have S(X2j) = {3} in the end of Phase 1.3, which was 
resulted/created in Phase 1.2 when a Phase- 1.2 packet was 
received by d 3 only. Totally there are « — ^TiTT^ sucrl 
packets, which are termed the queue Q 2 . 3 - packets. Consider 
all the packets X 3 j that have S(X 3 j) = {2} in the end of 
Phase 1.3, which was resulted/created in Phase 1.3 when a 
Phase- 1.3 packet was received by d 2 only. Totally there are 

— p 3} suc h packets, which are termed the queue Q 3 . 2 j 
packets. 

We order all the Q 2 - 3 j packets in any arbitrary sequence 
and order all the Q 3 . 2 j packets in any arbitrary sequence. In 
the beginning of Phase 2.1, we first select the head-of-the-line 
X2 J2 and the head-of-line X 3 j 3 from these two queues Q 2 . 3 y 
and Q 3 . 2 y, respectively. Since 

S(X 2 , J2 ) U {2} = T = {2,3} = S(X 3>j3 ) U {3}, 

these two packets can be linearly combined together. Let Vtx 
denote the overall coding vector generated from these two 
packets (see Line [7j of the main PE scheme). As discussed 
in Line [To] of the main PE scheme, we keep transmitting the 
same coded packet v tx until at least one of the two packets 
X2j 2 and X 3 j 3 has a new S(X 2j2 ) (or a new S(X 3 j 3 )). In 
the end, we thus have three subcases: (i) only X2j 2 has a new 
S(X2.j 2 ), (ii) only X 3 j 3 has a new S(X 3 j 3 ), and (iii) both 
X 2j2 has a new S(Xzj 2 ) and X 3 ,j 3 has a new S(X 3 j 3 ). In 
Case (i), we keep the same T = {2, 3} and the same X 3 j 3 
but switch to the next-in-line Q 2 . 3 y packet X 2 j 2 - The new 
X% f will be then be used, together with the existing X 3 J3 to 
generate new v tx in Line|7]of the main PE scheme for the next 
time slot(s). In Case (ii), we keep the same T = {2, 3} and 
the same X 2 .j 2 but switch to the next-in-line Q 3 . 2 j packet 
X 3 j' . The new X 3 j> 3 will then be used, together with the 
existing X2j 2 , to generate new v tx in Line [7J of the main PE 
scheme for the next time slot(s). In Case (iii), we keep the 
same T = {2, 3} and switch to the next-in-line packets X 2 j> 
and X 3 y . The new pair X 2 j> and X 3 y 3 will then be used 
to generate new v tx in Line [7] of the main PE scheme for the 
next time slot(s). We repeat the above process until we have 
used up all Q 3 . 2 j packets X 3 

Remark 1: One critical observation of the PE scheme is 
that when two packets X 2j2 or X 3 ^ 3 are mixed together to 
generate v tx , each packet still keeps its own identity X 2j2 
and X 3 ,j 3 , its own associated sets S(X2j 2 ) and S(X 3 j 3 ) 
and coding vectors v(X 2 j 2 ) and v(X 3 ,j 3 ). Even the decision 



whether to update S(X) or \r(X) is made separately (Line [2] 
of the Update) for each of the two packets X^j 2 or X 3 j 3 . 
Therefore, it is as if the two packets X2.j 2 or X 3 j 3 are sharing 
the single time slot in a non-interfering way (like carpooling 
together). Following this observation, in Phase 2.1, whether 
we decide to switch the current X2 j 2 to the next-in-line Q 2 . 3 y 
packet X 2 .j> 2 is also completely independent from the decision 
whether to switch the current X 3 j 3 to the next-in-line Q 3 . 2 j 
packet X 3 j^. 

Remark 2: We first take a closer look at when a Q 3 . 2 j packet 
X 3 j 3 will be switched to the next-in-line packet X 3 y . By 
Line S] of the UPDATE, we switch to the next-in-line X 3 ] ^ if 
and only if one of {di,d 3 } has received the current packet 
v tx , in which X 3 j 3 participates. Therefore, in average each 



X 3 j 3 will stay in Phase 2.1 for 



have 



Pu{l,3} 



time slots. Since we 



{2}{1,3} 



PU{1,2,3} 



number of Q 3 . 2 j packets to begin with, 



it takes 



Rj{l,2 : 3} RJ{1,3} 



= nR 3 
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Rj{l,3} RJ{1,2,3} 
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to completely finish the Q 3 . 2 j packets. By similar arguments, 
it takes 



lR 2P{3}TTl} 1_ 

Rj{l,2,3} Aj{1,2} 



nR 2 
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Rj{l,2} Rj{l,2,3} 
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to completely use up the Q 2 3 j packets. Since we assume that 
d 2 dominates d 3 , the dominance inequality in ( fl4t implies that 
(l22l is no smaller than (l2T1 i. Therefore we indeed can finish 
the Q 3 . 2 ~ packets before exhausting the Q 2 - 3 t packets. 

Remark 3: Overall it takes roughly $2l[ of time slots to 
finish Phase 2.1. 

Phase 2.2: After Phase 2.1, we move to Phase 2.2. In 
Phase 2.2, we always choose T = {1, 3} for the PE scheme. 
Consider all the packets Xij that have S(Xij) = {3} in 
the end of Phase 2.1, which was resulted/created in Phase 1.1 
when a Phase- LI packet was received by d 3 only. Totally 

such packets, which are termed 



nR\p 



{3}{1.2} 



there are 

PU{1,2.3} 

the queue Q 1 . 32 - packets. Consider all the packets X 3 ,j that 
have S(X 3 j) = {1} in the end of Phase 2.1, which was 
resulted/created in Phase 1.3 when a Phase-1.3 packet was 
received by d\ only. We note that there are some Q 3 . 2 j 
packets being transmitted in Phase 2.1. Before the trans- 
mission of Phase 2.1, those packets have S(X 3 j) = {2} 
and after the transmission of Phase 2.1, those packets will 
have their S(X 3 j) being one of the three forms {1,2}, 
{2,3}, and {1,2,3} (see Line g] of the Update). Therefore, 
Phase 2.1 does not contribute to any X 3 j packets considered 
in Phase 2.2 (those with S(X 3 j) = {1}). Totally there are 
« 1 n * packets considered in Phase 2.2, which are 

PU{1,2,3} ^ 

termed the queue Q 3 . 12 packets. 

We order all the Q 1 . 3 2 packets in any arbitrary sequence 
and order all the Q 312 packets in any arbitrary sequence. 
Following similar steps as in Phase 2.1, we first mix the 



head-of-the-line packets X\,j 1 and X 3 j 3 of Q x . 3 2 and Q312' 
respectively, and then make the decisions of switching to the 
next-in-line packets Xij and X 3 ji independently for the two 
queues Q 1 . 32 - and Q3 it We re P eat the above process until we 
have used up all Q 3 . 12 - packets X 3 j. 

Remark: We take a closer look at when a Q 3 . X 2 packet X 3 j 3 
will be switched to the next-in-line packet X^jt. By Line [4] 
of the Update, we switch to the next-in-line X 3 ^ if and 
only if one of {^2,(^3} has received the current packet v tx , 
in which X 3> j 3 participates. Therefore, in average each X 3 j 3 
will stay in Phase 2.2 for z—^ — time slots. Since we have 



3 U{2,3} 



— 3P{1>{2,3} number of ,« packets to begin with, it takes 
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= nR 3 
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(23) 



to completely finish the Q 31 2 packets. By similar arguments, 
it takes 



nR ip {3} {m 1 / 

— nrii 



Pu{i,2,3} Pu{l,2} 



Rj{l,2} Rj{l,2,3} 



(24) 



to completely use up the Q 1 . 32 - packets. Since we assume that 
d\ dominates d 3 , the dominance inequality in (fl4l implies that 
( l24l is no smaller than ( f23l . Therefore we indeed can finish 
the Q 3 . 12 - packets before exhausting the Q 1 . 32 - packets. Overall 
it takes roughly d23l number of time slots to finish Phase 2.2. 

Phase 2.3: After Phase 2.2, we move to Phase 2.3. In 
Phase 2.3, we always choose T = {1, 2} for the PE scheme. 
Consider all the packets Xij that have S(Xij) = {2} in 
the end of Phase 2.2, which was resulted/created in Phase 1.1 
when a Phase- 1.1 packet was received by d-2 only. Note that 
the transmission in Phase 2.2 does not create any new such 

nR iP{2}JTJ} 



packets. Totally there are thus 



such packets, 



Pull, 2. 3} 

which are termed the queue Q 1 . 23 packets. Consider all the 
packets X2J that have S(X% t j) = {1} in the end of Phase 2.2, 
which was resulted/created in Phase 1.2 when a Phase- 1.2 
packet was received by d\ only. Note that the transmission in 
Phase 2.1 does not create any new such packets. Totally there 

are thus « ^fEilMlM sucn packets, which are termed the 

Pun, 2, 3} r 
queue Q 2 -i3 packets. 

We order all the Q 1 . 93 packets in any arbitrary sequence 

and order all the Q 2 -i3 packets in any arbitrary sequence. 

Following similar steps as in Phases 2.1 and 2.2, we first 

mix the head-of-the-line packets X\j 1 and X2j 2 of Q 1 . 23 and 

Q 2 -i3> respectively, and then make the decisions of switching 

to the next-in-line packets Xijr and X 2 j> 2 independently 

for the two queues Q 1 . 23 and Q 2 13- ^ e re P ea t the above 

process until we have used up all Q 2 13 packets X2j- By the 

assumption that d\ dominates c?2 and by the same arguments as 

in Phases 2.1 and 2.2, we indeed can finish the Q 2 13 packets 



before exhausting the <5 12 3 packets. Overall it takes roughly 
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of time slots to finish Phase 2.3. 

Phase 3: Before the description of Phase-3 operations, we 
first summarize the status of all the packets in the end of 
Phase 2.3. For 1^3, all X 3 j packets that have S(X 3 j) = 
have been used up in Phase 1.3. All X 3 .j packets that have 
S(X 3 j) = {1} have been used up in Phase 2.2. All X 3 j 
packets that have S(X 3 .j) = {2} have been used up in 
Phase 2.1. As a result, all the X 3 t j packets are either received 
by d 3 (i.e., having 3 G S(X 3tj j) or have S(X 3>j ) = {1,2}. 
For Phase 3, we will focus on the latter type of X 3t j packets, 
which are termed the Q3 ; i2 packets. Recall the definition of 
fp(ST) in ©. Totally, we have 



nR 3 



( P123 + P123 /p( 23 ) + P2T3 /p( 13 ) 

VPU{1,2,3} Pu{l,2,3} Pu{2,3} Pu{l,2,3} Pu{l,3} 



(26) 



number of Q3 ; i2 packets in the beginning of Phase 3, where 
the first, second, and the third terms correspond to the <53 ; i2 
packets generated in Phase 1.3, Phase 2.2, and Phase 2.1, 
respectively. We can further simplify d26T > as 



26]) = nR 3 p 3 
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For d2, all X 2 ,j packets that have S(X2j) = have 
been used up in Phase 1.2. All packets that have 

S(X2j) = {1} have been used up in Phase 2.3. As a result, all 
the X2.3 packets must satisfy one of the following: (i) X2J 
are received by d 2 (i.e., having 2 G S(X 2 ,j)), or (ii) have 
S(X 2>j ) = {3}, or (iii) have S(X 2 , 3 ) = {1,3}. For Phase 3, 
we will focus on the latter two types of X2.] packets, which 
are termed the Q 9 . 3 y and the Q2;i3 packets, respectively. There 
are 

P2T3 PU{1,2} 



nR 2 



P312 



- nR 3 



(28) 



PU{1,2,3} PU{1,2,3} PU{1,3} 

number of Q 2 . 3 j packets in the beginning of Phase 3, where 
the first term is the number of Q 2 . 3 j packets generated in 
Phase 1.2 and the second term corresponds to the number 
of Q 2 . 3 j packets that are used up in Phase 2.1. d28l i can be 
simplified to 



23 = 
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There are 

Pl32 
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number of Q2 ; i3 packets in the beginning of Phase 3, where 
the first, second, and third terms correspond to the number of 
Q2 ; i3 packets generated in Phase 1.2, Phase 2.3, and Phase 
2.1, respectively. 

For di, all X\j packets that have S(Xij) = have been 
used up in Phase 1.1. As a result, all the X\j packets must 
satisfy one of the following: (i) X\,j are received by d\ (i.e., 
having 1 6 S{X 1 , j )), or (ii) have S(X ltj ) = {2}, (iii) have 
S(X hj ) = {3}, or (iv) have S(X hj ) = {2,3}. For Phase 3, 
we will focus on the types (ii) and (iii), which are termed the 
and the Q x . 32 packets, respectively. There are 

Pl23 PU{1,3} 
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Pu{l,2,3} Rj{l,2,3} Aj{2,3} 

number of Q V23 packets in the beginning of Phase 3, where 
the first term is the number of Q V23 packets generated in 
Phase 1.1 and the second term corresponds to the number 
of Q 1 . 23 packets that are used up in Phase 2.3. (|3T1 i can be 
simplified to 



IB =Rj{i,3} nRi 
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There are 
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nR 3 P ^ Pu{1 - 2} (33) 



PU{1,2,3} Aj{l,2,3} Pl_l{2,3} 

number of Q V32 packets in the beginning of Phase 3, where 
the first term is the number of Q 1 . 32 packets generated in 
Phase 1.1 and the second term corresponds to the number 
of Q x . 3 2 packets that are used up in Phase 2.2. (l33l can be 
simplified to 



33J) =Pu{i,2} nRx 
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,Pu{2,3) Pu{l,2,3}, 

We are now ready to describe Phase 3, which contains 3 
sub-phases. 

Phase 3.1: Similar to Phase 2.1, we choose T = {2,3} 
for the PE scheme. In Phase 2.1, we chose the Q 2 . 3 j packets 
X 2 j 2 and the Q 3 . 2 j packets X 3 j 3 satisfying S(X 2 j 2 ) = {3} 
and S(X 3 j 3 ) = {2}. Since we have already used up all Q 3 . 2 j 
packets in Phase 2.1, in Phase 3.1, we choose the Q 2 -3i packets 
X 2 j 2 and the new Q 3l \ 2 packets X 3 J3 instead, such that 
the packets satisfy S(X 2j2 ) = {3} and S(X 3)j3 ) = {1,2}. 
Similar to Phase 2.1, we switch to the next-in-line packet 
as long as the S(X 2 j 2 ) (or S(X 3 j 3 )) is changed. Again, 
the decision whether to switch from X 2 j 2 to the next-in- 
line packet X 2 j> 2 is independent from the decision whether 
to switch from X 3 j 3 to the next-in-line packet X 3 y . 

Note that, by Line|4]of the UPDATE, the S(X 2 j 2 ) of a Q 2 . 3 j 
packet X 2 j 2 will change if and only if it is received by any 
one of {d%, d 2 }. Therefore, in average each Q 2 . 3 j packet X 2 j 2 
will take — - — number of time slots before we switch to the 

Pu{l,2} 



next-in-line packet X 2 y. For comparison, the S(X 3 j 3 ) of a 
Q3 ; i2 packet X 3 ,j 3 will change if and only if it is received by 
{d 3 }. Therefore, in average each Q 3: i 2 packet X 3 j 3 will take 
— number of time slots before we switch to the next-in-line 

P3 

packet X 3 ji, We continue Phase 3.1 until we have finished all 
Q 2 . 3 j packets. It is possible that we finish the Q 3: i 2 packets 
before finishing the Q 2 3T packets. In this case, we do not 
need to transmitting any <53;i2 packets anymore and we use a 
degenerate T = {2} instead and continue Phase 3.1 by only 
choosing Q 2 . 3 y packets X 2 j 2 . Intuitively, Phase 3.1 is a clean- 
up phase that finishes the Q 2 . 3 j packets that have not been 
used in Phase 2.1. While finishing up Q 2 . 3 j packets, we also 
piggyback some Q 3: i2 packets through network coding. If all 
Q 3 -i 2 packets have been used up, then we continue sending 
pure Q 2 . 3 j packets without mixing together any Q 3: i2 packets. 

Since we have (l29l i number of Q 2 . 3 j packets to begin with, 
it will take 
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,Rj{l,3} Rj{l,2,3}, 

number of time slots to finish Phase 3.1. 

Remark: When we transmit a Q 2 . 3 j packet X 2 j 2 , the new 
S(X 2 j 2 ) becomes {1,3} if and only if S a = {1} (i.e., only 
d\ receives X 2 j 2 ). Therefore Phase 3.1 will also create some 
new Q 2; i3 packets. After Phase 3.1, the number of Q 2; i3 
packets is changed from (f30b to 



nR 2 



P132 , P123 /p(32) | P3T2 / P (12) 
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where the first, second, and the third terms correspond to the 
Q2 ; i3 packets generated in Phase 1.3, Phase 2.3, and Phase 2.1 
plus Phase 3.1, respectively. We can further simplify d36t as 
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Phase 3.2: After Phase 3.1, we move to Phase 3.2. Similar 
to Phase 3.1, Phase 3.2 serves the role of cleaning up the Qi. 32 
packets that have not been used in Phase 2.2. More explicitly, 
we choose T = {1,3}, and use the Q 1 . 32 packets Xij ± and 
the new Q3 ; i2 packets X 3 ,j 3 , such that the packets satisfy 
S{Xi tjl ) = {3} and S(X 3J . 3 ) = {1,2}. It is possible that all 
Q3 ; i2 packets have been used up in Phase 3.1. In this case, we 
do not need to transmitting any Q 3: i 2 packets anymore and 
we use a degenerate T = {1} instead and continue Phase 3.1 
by only choosing Q 1 . 3 ? packets X\ j x . 

Similar to all previous phases, we switch to the next-in- 
line packet as long as the S(X± t j 1 ) (or S(X 3 j 3 )) is changed, 
and the decision whether to switch from X± t j 1 to the next-in- 
line packet Xij> is independent from the decision whether 
to switch from X 3 j 3 to the next-in-line packet X 3 j> . We 
continue Phase 3.2 until we have finished all Q 1 . 3 ? packets. 
Again, if we finish the Q3 : i2 packets before finishing the Q 1 . 32 



packets, then we stop transmitting any Qz t i2 packets, use a 
degenerate T = {1} instead, and continue Phase 3.2 by only 
choosing Qi-$z packets X\j ± . 

By Line S] of the UPDATE, the S(Xi_ J1 ) of a Q V32 packet 
Xiji will change if and only if it is received by any one 
of {di,d 2 }. Therefore, in average each Q 1 . 32 packet Xij 1 
will take — - — number of time slots before we switch to the 

Pu{l,2} 

next-in-line packet Xiji . Since we have ( 134b number of Q 1 . 32 
packets to begin with, it will take 
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number of time slots to finish Phase 3.2. 

Phase 3.3: After Phase 3.2, we move to Phase 3.3. Similar 
to Phases 3.1 and 3.2, Phase 3.3 serves the role of cleaning up 
the Q x . 2 3 packets that have not been used in Phase 2.3. More 
explicitly, we choose T = {1,2}, and use the Q 1 . 23 packets 
Xij 1 and the new Q2;i3 packets X 2t j 2 , such that the packets 
satisfy S(X lth ) = {2} and S(X 2 J 2 ) = {1,3}. Recall that 
in the beginning of Phase 3.3, we have d37l i number of Q2;i3 
packets. 

Similar to all previous phases, we switch to the next-in- 
line packet as long as the S(Xi j 1 ) (or S(X 2 j 2 )) is changed, 
and the decision whether to switch from Xxj 1 to the next-in- 
line packet Xi j^ is independent from the decision whether 
to switch from X 2 j 2 to the next-in-line packet X 2 j>. We 
continue Phase 3.3 until we have finished all Q 1 . 23 packets. If 
we finish the Q 2 ,i3 packets before finishing the Qi-23 packets, 
then we stop transmitting any Q 2 -i3 packets, use a degenerate 
T = {1} instead, and continue Phase 3.3 by only choosing 
QiiS P ac kets Xij 1 . 

By Line S] of the UPDATE, the S(X ltjl ) of a Q 1 . 23 packet 
Xij 1 will change if and only if it is received by any one 
of {di,ds}. Therefore, in average each Q 1 . 23 packet Xi j 1 



will take 



number of time slots before we switch to the 



PU{1,3} 

next-in-line packet Xij' . Since we have i32\ number of Q 1 . 23 
packets to begin with, it will take 
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1 



1 



nR 2 
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(39) 



number of time slots to finish Phase 3.3. 

Phase 4: We first summarize the status of all the packets 
in the end of Phase 3.3. For d 3 , all the X 3 j packets are 
either received by d 3 (i.e., having 3 G S(Xs ! j)) or have 
S(X 3 j) = {1,2}, the Q3;i2 packets. By Line 0] of the 
Update, the S(X 3j3 ) of a Q3-i 2 packet X 3 , J3 will change if 
and only if it is received by d 3 . Therefore, in average each 
£?3;i2 packet Xaj 3 will take ^ number of time slots before we 
switch to the next-in-line packet X^f . Since the Q3 ; i2 packets 
participate in Phases 3.1 and 3.2, in the end of Phase 3.3, the 



total number of Q3 ; i2 packets becomes 

(Eq.(|27l)-P3-Eq.(l35l)-P3-Eq.(|38l)) + , (40) 

where (•)+ = max(-,0) is the projection to the non-negative 
reals. 

For d 2 , all X 2 j packets that have S(X 2 j) = or 
S(X 2 ,j) = {1} have been used up in Phase 1.2 or Phase 2.3, 
respectively. All X 2 j packets that have S(X 2 j) = {3} have 
been used up in Phases 2.1 and 3.1. As a result, all the X 2 j 
packets are either received by d 2 (i.e., having 2 G S(X 2 j)) 
or have S(X 2 j) = {1, 3}, the Q 2 -i3 packets. By Line|4]of the 
Update, the S(X 2 j 2 ) of a Q 2 -i3 packet X 2 , n will change if 
and only if it is received by d 2 . Therefore, in average each 
Q2;i3 packet X 2 j 2 will take ^ number of time slots before 
we switch to the next-in-line packet X 2 j/ . Since the <52;i3 
packets also participate in Phase 3.3, in the end of Phase 3.3, 
the total number of Q 2t i3 packets becomes 



(Eq.d37j -pa-Eq.@5))- 



(41) 



For di, all X XJ packets that have S(X 1 j) = 0, S(X 1 j) = 
{2}, and S(Xij) = {3} have been used up in Phases 1.1, 
2.3+3.3, and 2.2+3.2, respectively. As a result, all the X\j 
packets are either received by d\ (i.e., having 1 G S(Xij)) 
or have S(X± t j) = {2,3}, the Qi- 2 3 packets. In the end of 
Phase 3.3, the total number of Qi- 2 3 packets is 



nRi 



( P23T 



P2T3 /p( 31 ) , P3T2 fp( 21 ) 



\PU{1,2,3} PU{1,2,3} PU{1,3} PU{1,2,3} Pu{l,2} 



(42) 



where the first, second, and the third terms correspond to the 
Q 1;2 3 packets generated in Phase 1.1, 2.3+3.3, and 2.2+3.2, 
respectively. We can further simplify d42l as 



ED = nR lPl [ — - 



1 



1 



Pi Pu{l,2} PU{1,3} Pu{l,2,3} 



(43) 



In Phase 4, since the only remaining packets (that still need 
to be retransmitted, see Lemma |3j are the Qi, 2 3, Q2,i3, and 
<33;i2 packets, we always choose T = {1, 2, 3} and randomly 
and linearly mix the Qi- 2 3, Q2;i3, and Q3-i 2 packets (one 
from each queue) for each time slot. That is, we use Phase 4 to 
clean up the remaining packets. Since in average a Qj ; {i,2,3}\i 
packet Xij takes ~ amount of time before it is received by 
di, Phase 4 thus takes 

'Eq.dH Eq.gB Eq.(gO| N 



max — - , — , — . (44) 

V Pi P2 P3 J 

number of time slots to finish. More precisely, as time pro- 
ceeds, we need to gradually switch to a degenerate T. For 
example, if the Q2 ; i3 packets are used up first, then we set 
the new T = {1,3} and focus on mixing the remaining <5i ; 23 
and Q3 ; i2 packets. After d44b number of time slots, it is thus 
guaranteed that for sufficiently large n, all information packets 
X k ,j, k G {1,2,3}, and j G [nR k ] satisfy k G S(X kij ). 
By Corollary [2] all d k can decode the desired packets X k j, 
j G [nRk] with close-to-one probability. 



Quantify the throughput of the 4-phase scheme: The 

remaining task is to show that if (R\, R 2 , R3) is in the interior 
of the outer bound in Proposition [TJ then the total number of 
time slots used by the above 4-Phase PE scheme is within the 
time budget n time slots. That is, we need to prove that 



(HU) + U9J) + d20]) + <J2TD + d23j) 

+ © + @U + {SD + ||35) + ©<n. (45) 

The summation of the first nine terms of the left-hand side of 
can be simplified to 

111 



,Pu{l,2} Pu{l,3} Pu{l,2,3} 

+ nR 2 + nR 3 , 

Pu{l,2} Pu{l,2,3} 

where Ai.i_3.3 is the total number of time slots in Phases 1.1 to 
3.3. Since (44) is the maximum of three terms, proving (45) is 
thus equivalent to proving that the following three inequality 
hold simultaneously. 



-4.1.1-3.3 + 



Pi 



< n, 



-A1.1-3.3 H < n, 



and A) 



P2 



P:\ 



< n. 



With direct simplification of the expressions, proving the 
above three inequalities is equivalent to proving 



< n, 



and 



nRi 


nR 2 


nR 3 


+ 

Pi 


Rj{l,2} 


Rj{l,2,3} 


nRi 


H + 


ni? 3 


PU{1,2} 


P2 


Pu{l,2,3) 


nRi 


nR 2 


nR 3 



Rj{l,3} Rj{l,2,3} 



P'i 



< n, 



< n, 



which hold for any (Ri,R 2 , R3) in the interior of the capacity 
outer bound in Proposition Q] More rigorously, by the law of 
large numbers, the expressions of the numbers of time slots 
in Phase 1.1 to Phase 4: (US), (ED, (ED), <ED, (23), (25), 
(35) , (38), (39), and (44) , are all of precision o{n). Since 
(Ri,R 2 , R3) is in the interior of the capacity outer bound in 
Proposition Q] the last three inequalities hold with arbitrarily 
close to one probability for sufficiently large n. The proof of 
Proposition E] is thus complete. 

B. Achieving the Capacity of l-to-3 Broadcast PECs With 
COF — High-Level Discussion 

As discussed in Section [V-AI one advantage of a PE scheme 
is that although different packets Xk,j k and Xi j. with k ^ i 
may be mixed together, the corresponding evolution of Xkj k 
(the changes of S(Xkj k ) and v(Xkj k )) are independent from 
the evolution of Xj^ . Also by LemmaEl two different packets 
Xkj k and Xi t j ( can share the same time slot without inter- 
fering each other as long as i G S(Xk.j k ) and k e S(Xij i ). 
These two observations enable us to convert the achievability 
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Fig. 4. The state transition diagram for destination d\ when applying the 
packet evolution scheme to a l-to-3 broadcast PEC. 



problem of a PE scheme to the following "time slot packing 
problem." 

Let us focus on the (s, dx) session. For any X\j packet, 
initially S(Xij) = 0. Then as time proceeds, each Xij 
starts to participate in packet transmission. The corresponding 
S(Xij) evolves to different values, depending on the set 
of destinations that receive the transmitted packet in which 
Xij participates. Since in this subsection we focus mostly 
on S(Xij), we sometimes use S(X) as shorthand if it is 
unambiguous from the context. Fig. [4] describes how S(X) 
evolves between different values. In Fig. |H we use circles to 
represent the five different states according to the S(X) value. 
Recall that S rx is the set of destinations who successfully 
receive the transmitted coded packet. The receiving set S IX 
decides the transition between different states. In Fig. [4] we 
thus mark each transition arrow (between different states) by 
the value(s) of S lx that enables the transition. For example, by 
Line H of the UPDATE, when the initial state is S(X) = 0, if 
the receiving set S lx 3 1, then the new set satisfies S(X) 3 1. 
Similarly, when the initial state is S(X) = 0, if S a = {2, 3}, 
then the new S(X) becomes S(X) = {2, 3}. (Note that the 
corresponding v(Xij) also evolves over time to maintain the 
non-interfering property in Lemma El which is not illustrated 
in Fig. |4] ) 

Since S(Xij) 3 1 if and only if d\ receives Xij, it thus 
takes logical time slots to finish the transmission of nR\ 
information packets. On the other hand, some logical time 
slots for the (s, d±) session can be "packed/shared" jointly 
with the logical time slots for the (s,effc) session, k ^ 1, or, 
equivalently, one physical time slot can serve two sessions 
simultaneously. For the following, we quantify how many 
logical time slots of the (s, d\) session are compatible to those 
of other sessions. For any So € 2^ 1,2 ' 3 ^, let Ax ; s denote the 
number of logical time slots (out of the total time slots) 
such that during those time slots, the transmitted X±j has 
S(Xij) = So. Initially, there are nR\ packets X\j. If any one 
of {^1,^2,^3} receives the transmitted packet (equivalently 
5rx 7^ 0), S(Xij) becomes non-empty. Therefore, each Xij 



contributes to 
We thus have 



Pu{l,2,3} 



logical time slots with S(X 



Id I 



Ai ; = nRi 



1 



Pu{l,2,3} 



(46) 



We also note that during the evolution process of Xij, if any 
one of {di,d 3 } receives the transmitted packet (equivalently 
St% H {1, 3} 7^ 0), then S(X) value will move from one of the 
two states "S(X) = 0" and "S(X) = {2}" to one of the three 
states "S(X) = {3}," "S(X) = {2, 3}," and "S(X) 3 1." 
Therefore, each Xi j contributes to 
for which we either have S(Xij) = 
the above reasoning, we have 



— - — logical time slots 

'U{1,3} ° 

or S(X ltj ) = {2}. By 



Ai;{2} + M-% = nRi 



1 



Pu{l,3} 



(47) 



Similarly, during the evolution process of Xij, if any one of 
{di,d 2 } receives the transmitted packet (equivalently 5 1X n 
{1, 2} 7^ 0), then S(X) value will move from one of the two 
states "S(X) = 0" and "S(X) = {3}" to one of the three 
states "S(X) = {2}," "S(X) = {2, 3}," and "S(X) 3 1." 
Therefore, each X-i , contributes to — - — logical time slots 

i,J Pu{1.2} & 

for which either S(Xi tj ) = or S(Xi tj ) = {3}. By the above 
reasoning, we have 



A l;{3} + A l;0 = nR l 



1 



PU{1,2} 



(48) 



Before S(X) evolves to the state "S(X) 3 1," any logical time 
slot contributed by such an X must have one of the following 
four states: "S(X) = 0," "S(X) = {2}," "S(X) = {3}," and 
"S(X) = {2,3}." As a result, we must have 



( ^ ) 



Solving (O, (|47l , (|48] i, and (g9]l, we have 
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(50) 
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(52) 
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(53) 



We can also define A^s as the number of logical time slots of 
the (s,dk) session with S(Xk,j k ) = So. By similar derivation 



arguments, we have 
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(61) 

Recall that by definition, Af. ; s i s me number of logical time 
slots of the (s, dk) session that is compatible to the logical time 
slots of (s,di) session with i G So- The achievability problem 
of a PE scheme thus becomes the following time slot packing 
problem. 

Consider 12 types of logical time slots and each 
type is denoted by (k; So) for some k G {1,2,3}, 
S Q G 2 {1 ' 2 ' 3 \ and k f S . The numbers of logical 
time slots of each type are described in ( TSUI ) to 
doll . Two logical time slots of types (ki\Si) and 
(k 2 ; S 2 ) are compatible if k\ 7^ k 2 , ki G 5*2, and 
k 2 G Si. Any compatible logical time slots can be 
packed together in the same physical time slot. For 
example, consider the following types of logical time 
slots: (1;{2,3», (2; {1,3}), and (3; {1,2}). Three 
logical time slots, one from each type, can occupy 
the same physical time slot since any two of them 
are compatible to each other. The time slot packing 
problem is thus: Can we pack all the logical time 
slots within n physical time slots? 
The detailed 4-phase PE scheme in Section IV-AI thus 
corresponds to the time-slot-packing policy depicted in Fig. [5] 
Namely, we first use Phases 1.1 to 1.3 send all the logical 
time slots that cannot be packed with any other logical time 
slots. Totally, it takes Ai.§ + A 2; + A 3 .0 number of time 
slots to finish Phases 1.1 to 1.3. We then use Phases 2.1 to 
2.3 to pack those logical time slots that can be packed with 
exactly one other logical time slot from a different session. 
By the assumption that di dominates d 2 and d 3 , and d 2 
dominates d 3 , we have ^4i ; {2} > ^2;{i}^ ^i:{3} > ^3:{i}' an d 
^2;{3} > ^3;{2}- Therefore, it takes A 3 . {2} + A 3 . { i } + A 2 . { i } 
number of physical time slots to finish Phases 2.1 to 2.3. 
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Number of 
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max(^4 1: { 2 .3}, ^2:{i,3} - Phase 3.3, 
^3;{i,2} - (Phases 3.1 + 3.2)) 



Fig. 5. The time-slot packing policy that corresponds to the 4-Phase solution for l-to-3 broadcast PECs. The shaded rectangles represent the logical time 
slots of types (1; {2, 3}), (2; {1, 3}), and (3; {1, 2}). 



Phases 3.1 to 3.3 are to clean up the remaining logical 
time slots of types (2; {3}), (1; {3}), and (1; {2}). We notice 
that in Phase 3.1 when sending a logical time slot of type 
(2; {3}), there is no type-(3, {2}) logical time slot that can be 
mixed together. On the other hand, there are still some type- 
(3, {1,2}) logical time slots, which can also be mixed with 
the logical time slots of the (s,c?2) session. Therefore, when 
we send a logical time slot of type (2; {3}), the optimal way 
is to pack it with a type-(3, {1, 2}) logical time slots together 
as illustrated in Phase 3.1 of Fig. [5] It is worth emphasizing 
that although those type- (3, {1, 2}) logical time slots can be 
packed with two other logical time slots simultaneously, there 
is no point to save the type-(3, {1, 2}) logical time slots for 
future mixing. The reason is that when Phase 3.1 cleans up the 
remaining type-(2; {3}) logical time slots, it actually provides 
a zero-cost free ride for any logical time slot that is compatible 
to a type- (2; {3}) logical time slot. Therefore, piggybacking a 
type-(2;{3}) logical time slot with a type-(3, {1, 2}) logical 
time slot is optimal. Similarly, we also take advantage of 
the free ride by packing logical time slots of type- (1; {3}) 
with that of type-(3; {1, 2}) in Phases 3.2, and by packing 
logical time slots of type-(l; {2}) with that of type-(2; {1, 3}) 
in Phases 3.3. It thus takes 

(^2;{3} - ^3;{2}) + (^1;{3> ~ ^3;{1}) + (^1;{2} ~ ^2;{1}) 

number of time slots to finish Phases 3.1 to 3.3. 

In Phase 4, we clean up and pack together all the remain- 
ing logical time slots of types (1;{2,3}), (2; {1,3}), and 
(3; {1,2}). We thus need 

max ((^4 3 ;{l,2} - (^2;{3} - ^3;{2}) - ( A l;{3] ~ ^3;{1})) + , 

U»;{1,3} - (A 1;{2} - A 2;{1} ))+, A u{2t3} ) (62) 

number of time slots to finish Phase 4. Depending on which 
of the three terms in (1621 is the largest, the total number of 
physical time slots is one of the following three expressions: 

^3;0 + ^3:{1} + ^3;{2} + ^3;{1,2} + M;$ + ^1;{2} + ^2;0, 
^2:0 + ^2;{1} + ^2;{3} + ^2:{1,3} + 4l.;0 + A;{3} + ^3:0: 
Or A 1; + A:{2} + 4l;{3} + ^1;{2,3} + ^2:0 + ^2;{3} + ^3;0- 



By (l50t to doTT l. one can easily check that all three equations 
are less than n for any (R±, R2, R3) in the interior of the 
outer bound of Proposition [T] which answers the time-slot- 
packing problem in an affirmative way. One can also show that 
the packing policy in Fig. [5] is the tightest among any other 
packing policy, which indeed corresponds to the capacity- 
achieving PE scheme described in Section IV-AI 

C. The Achievability Results of General 1-to-M Broadcast 
PECs With COF 

In Section IV-BI we show how to reduce the achievability 
problem of a PE scheme to a time-slot-packing problem. 
However, the converse may not hold due to the causality 
constraint of the PE scheme. By taking into account the 
causality constraint, the time-slot-packing arguments can be 
used to generate new achievable rate inner bounds for general 
1-to-A/ broadcast PECs with COF, which will be discussed in 
this subsection. 

One major difference between the tightest solution of the 
time-slot-packing problem in Fig. and the detailed PE 
scheme in Section IV-AI is that for the former, we can pack 
the time slots in any order. There is no need to first pack 
those logical time slots that cannot be shared with any other 
time slots. Any packing order will result in the same amount 
of physical time slots in the end. On the other hand, for the PE 
scheme it is critical to perform the 4 phases (10 sub-phases) 
in sequence since many packets used in the later phase are 
generated by the previous phases. For example, all the packets 
in Phases 2 to 4 are generated in Phases 1.1 to 1.3. Therefore 
it is imperative to conduct Phase 1 first before Phases 2 to 4. 
Similarly, the Qz-,{\,2} packets used in Phases 3.1 and 3.2 are 
generated in Phases 1.3, 2.1, and 2.2. Therefore, the number 
of Qs-Ji,2} packets in the end of Phase 1.3 (without those 
generated in Phases 2.1 and 2.2) may not be sufficient for 
mixing with Qi ; {3} packets. As a result, it can be suboptimal 
to perform Phase 3.1 before Phases 2.1 and 2.2. 

The causality constraints for a 1-to-M PEC with M > 4 
quickly become complicated due to the potential cyclic depen- 



denc^l of the problem. To simplify the derivation, we consider 
the following sequential acyclic construction of PE schemes, 
which allows tractable performance analysis but at the cost 
of potentially being throughput suboptimal. As will be seen in 
Section [VI-D1 for most PEC channel parameters, the proposed 
sequential acyclic PE schemes are sufficient to achieve the 
channel capacity. 

For the following, we describe the sequential PE schemes. 
The main feature of the sequential PE scheme is that we 
choose the mixing set T in a sequential, acyclic fashion. For 
comparison, the T parameters used in the capacity-achieving 
l-to-3 PE scheme of Section |V-Al are {1}, {2}, {3}, {2,3}, 
{1,3}, {1, 2}, {2, 3}, {1, 3}, {1, 2}, and {1, 2, 3} in Phases 1.1 
to 4, respectively. We notice that T = {1, 2} is visited twice 
in Phases 2.3 and 3.3. We thus call the capacity-achieving PE 
scheme a cyclic PE scheme. For the sequential PE schemes, 
we never revisit any T value during all the phases. 

To design a sequential PE scheme, we first observe that 
in the capacity-achieving 4-Phase PE scheme in Section IV-AI 
we always start from mixing a small subset T then gradually 
move to mixing a larger subset T. The intuition behind is that 
when mixing a small set, say T = {2, 3} in Phase 2.1, we can 
create more coding opportunities in the later Phase 4 when 
T = {1,2,3}. Recall the definition of cardinality-compatible 
total ordering -< on 2^1 in (0. For a sequential PE scheme, we 
thus choose the mixing set T from the smallest to the largest 
according to the given cardinality-compatible total ordering. 
The detailed algorithm of choosing T and the target packets 
Xk,j k , k G T, is described as follows. 

There are (2 K — 1) phases and each phase is indexed by 
a non-empty subset T C [K], We move sequentially between 
phases according to the cardinality-compatible total ordering 
-<. That is, if T\ -i, T 2 and there is no other subset T3 satisfying 
T\ ~< T3 -< T2, then after the completion of Phase T\, we move 
to Phase T 2 . 

Consider the operation in Phase T. Recall that the basic 
properties of the PE scheme allow us to choose the target 
packets Xkj k independently for all k £ T. In Phase T, 
consider a fixed k S T. Let Sk = T\k. We first choose a Qk-s k 
packet Xk,j k , i.e., those with S(Xkj k ) = Sk, and keep using 
this packet for transmission, which will be mixed with packets 
from other sessions according to Line [7] of the PE scheme. 
Whenever the current Xk,j k packet evolves (the corresponding 
S(Xk,j k ) changes), we move to the next Qk-,s k packet Xk,j> . 
Continue this process for a pre-defined amount of time slots. 
We use Wk-.s k ^s k to denote the number of time slots in which 
we choose a Qk\S k packet. After Wk-s k ^S k number of time 
slots, we are still in Phase T but we will start to choose 
a different Qk-s k P ac ket Xkj k (i.e., with S(Xk.j k ) = Sk), 
which will be mixed with packets from other sessions in T. 

5 For general 1-to-M PECs with M > 4, we may have the following cyclic 
dependence relationship: Packet mixing in Phase A needs to use the packets 
generated by the packet mixing during Phase B. Packing mixing in Phase B 
needs the packets resulted from the packet mixing during Phase C. But the 
packing mixing of Phase C also needs the packets resulted from the packing 
mixing in Phase A. Quantifying such a cyclic dependence relationship with 
causality constraints is a complicated problem. 



More explicitly, we choose a sequence of Sk such that all Sk 
satisfy Sk Q Sk C ([K]\k), which guarantees that such new 
Xkj k with S(Xkj k ) = Sk is still non-interfering from the 
perspectives of all other sessions in T. The order we choose 
the Sk follows that of the total ordering -<. The closer Sk is 
to Sk, the earlier we use such Sk. 



For any chosen Sk, we choose a Q k .§ P ac ket Xk,j k , i.e., 
those with S(Xk.j k ) = Sk, and keep using this packet to 
generate coded packets for transmission. Whenever the current 
Xk,j k packet evolves (the corresponding S(Xk.j k ) changes), 
we move to the next Q k § k packet Xk.j> k . Continue this process 
for a pre-defined amount of time slots. We use w k .g k ^ Sk to 
denote the number of time slots in which we choose a Qk-s k 
packet. That is, w k .g k ^ Sk is the number of time slots that 
we are using a Qk-s k P ac ket in substitute for a Qk-,s k packet, 
which is similar to the operations in Phases 3.1 to 3.3. After 
w, 5 jC number of time slots, we are still in Phase T but 
we will move to the next eligible Sk according to the total 
ordering -<. Continue this process until all Sk have been used. 



Since we choose the target packet Xkj k independently for 
all k, Phase T thus takes 



XT = wtir ^ ™fc;S-KT\fc) (63) 

US:(T\k)CSC([K]\k) J 



number of time slots to finish. Since we have totally (2 K — 1) 
different phases, it thus takes X)vTe2i A ']'7V0 Xt to finish a U 
the phases. 



For the following, we will show that there exists a feasible 
sequential PE scheme if the choices of {xt '■ VT 6 2^,7^ 
0} and {w k -,s->T ■ Vfc e [A'],VT CSC {[K]\k)} satisfy (gS 



and the following equations: 

xt < n(l — e) for some e > 

Vfc g [A'], iy fc; 0^.0 -p U [A'] = nRk 
VfcG [Jf],V5C ([AT]\fc),S*^0, 



(64) 
(65) 



E w k;S- 
, VTnTiCS 



•Ti Pu([K]\S) 



E 

VSi , Ti : such that 
Ti C Si C ([K]\fc), 
Ti C S, S £ Si 



^Si^Ti-/p((S\7\)([Al\S) 



(66) 



Vfc G [K],S,Te 2^1 satisfying T C S* C ([AT]\fc),T ^ S, 



Wk;S^T 



E 

VTi C S : 
(TiU{fc}) -< (Tu{fe}) 



Wfe;S->Ti 



Rj([AT]\S) 



< 



E u>* ;Sl -,r-/p((5\T)([if]\5)) + 



VSi : Si X s. 
T C Si C ([K]\k) 

E 

VSi . Ti : such that 
Ti C Si C ([if]\fe), 
(TiU{fc}) X (Tu{fc», 
Ti C S, S £ Si 



(67) 

Note that d63t to (|67T i are similar to ([8]l to ( fT2l of the achiev- 
ability inner bound in Proposition [3] The only differences are 
(i) The new scaling factor n in ( l64l i and (l65l l when compared 
to dH) and ( [Tol l; (ii) The use of the max operation in ( |63| > 
when compared to (O; and (iii) The equality "=" in ( l65l l and 
( |66] > instead of the inequality ">" in (TTOb and (fTTT i. The first 
two differences (i) and (ii) are simple restatements and do 
not change the feasibility region. The third difference (iii) can 
be reconciled by sending auxiliary dummy (all-zero) packets 
in the PE scheme as will be clear in the following proof. 
As a result, we focus on proving the existence of a feasible 
sequential PE scheme provided the new inequalities d63l l to 
(loTt are satisfied. 

Assuming sufficiently large n, the law of large numbers 
ensures that all the following discussion are accurate within 
the precision o(n), which is thus ignored for simplicity. (l64l 
implies that we can finish all the phases within n time slots. 
Since each Qk-ti packet Xk.j k in average needs p J time 
slots before its S(Xk.j k ) evolves to another value, d65l l ensures 
that after Phase {fc}, all Qk-.$ packets have been used up and 
evolved to a different Qk-s packet. 

Suppose that we are currently in Phase (TU {fc}) for some 
fc ^ T, and suppose that we just finished choosing the Qk-s' 
packet for some old S' and are in the beginning of choosing a 
new Qk-s packet (with a new S ^ S') that will subsequently 
be mixed with packets from other sessions. By Line [4] each 
Qk-s packet evolves to a different packet if and only if one 



of the d{ with i G ([ATjyS*) receives the coded transmission. 
Therefore, sending Qk-s packets for Wk-s^T number of time 
slots will consume additional Wk-s^-T ■ Pu([K]\s) number of 
Qk-s packets. Similarly, the previous phases (Ti U {fc}) such 
that Ti C S and (Ti U {fc}) < (T U {fc}), have consumed 
totally 



E 



{Wk-S^n ■ Pu([K]\S)) 



VTi C S : 
(Ti U{fc}) X (Tu{t)) 



number of Qk-s packets. The left-hand side of (|67| | thus 
represents the total number of Qk t s packets that have been 
consumed after finishing the Wk-,s^T number of time slots of 
Phase (TU{fc}) sending Qk_s packets. As will be shown short 
after, the right-hand side of d6Tb represents the total number 
of Qk : s packets that have been created until the current time 
slot. As a result, (l67l i corresponds to a packet-conservation 
law that limits the largest number of Qk-,s packets that can be 
used in Phase (T U {fc}). 

To show that the right-hand side of (ISTT i represents the total 
number of Qk : s packets that have been created, we notice 
that the Qk-s packets can either be created within the current 
Phase (T U {fc}) but during the previous attempts of sending 
Qk-S! packets in Phase (TU {fc}) with Si -< S; or be created 
in the previous phases (TiU{fc}) with (TiU{fc}) X (TU{fc}). 
The former case corresponds to the first term on the right-hand 
side of d67] i and the latter case corresponds to the second term 
on the right-hand side of (|67| |. 

For the former case, for each time slot in which we transmit 
a Qfc ; Si packet in Phase (TU {fc}), there is some chance that 
the packet will evolve into a Qk-s packet. More explicitly, 
by Line|4]of the UPDATE, a Qk-Sx packet in Phase (TU {A;}) 
evolves into a Qk-.s packet if and only if the packet is received 
by all di with i G (S\T) and not by any d, with i G ([AT]\S) . 
As a result, each such time slot will create f p ((S\T)([K]\S)) 
number of Qk-,s packet in average. Since we previously sent 
Qk-.Si packets for a total Wk-S!^T number of time slots, the 
first term of the right-hand side of ( f6Tb is indeed the number 
°f Qk-s packets created within the current Phase (T U {fc}) 
but during the previous attempts of sending Qk-.Si packets. 

For the latter case, for each time slot in which we transmit 
a Qfe ; Si packet in Phase (T\ U {fc}), there is some chance 
that the packet will evolve into a Qk-s packet, provided we 
have T C S and S ^ Si. More explicitly, by Line g] of the 
Update, a Qfe ; Si packet in Phase (T U {fc}) evolves into a 
Qk-.s packet if and only if 

'TiQS (TxCS 

S a £S! . , . \S<£Sl 

, , % or equivalently < , ; , 

(S\Ti)CS a 4 j(5\Ti)CS^ 

X[K]\S) C ([K]\S a ) U[K]\S) C ([K]\S a ) 

Therefore, for any (Si,Ti) pair satisfying Ti C S and 
S Si, a Q k - Sl packet in Phase (T U {fc}) will have 
f p ((S\Ti)([K]\S)) probability to evolve into a Qk-.s packet. 
Since we previously sent Qk-Si packets in Phase (Ti U {fc}) 



for a total Wk;Si-yTi number of time slots, the second term 
of the right-hand side of d67l i is indeed the number of Qk-.s 
packets created during the attempts of sending Qk-,Si packets 
in the previous Phase (T\ U {k}). 

Suppose that we are currently in Phase (S U {fc}) for 
some k £ S. To justify (l66l i. we first note that in the 
sequential PE construction we only select the packets Xkj 
with k £ S(Xk,j). By Line|4]of the UPDATE, each packet Xkj 
transmitted in Phase T is either received by the intended desti- 
nation dk, or it will evolve into a new S(Xk.j) that is a proper 
superset of (T\k). As a result, the cardinality-compatible total 
ordering "-<" ensures that once we are in Phase (S U {k}), 
any subsequent Phase T with (S U {k}) -< T will not create 
any new Qk-,s packets. Therefore, if we can clean up all Qk-s 
packets in Phase (SU{k}) for all S C ([K]\k), then in the end 
of the sequential PE scheme, there will be no Qk-s packets 
for any S C ([K]\k). This thus implies that all Xkj packets 
in the end must have S(Xkj) 3 k. By Lemma[3] decodability 
is thus guaranteed. (l66l l is the equation that guarantees that we 
can clean up all Qk-s packets in Phase (S U {k}). 

By similar computation as in the discussion of the right- 
hand side of (l67l i. the right-hand side of (l66l l is the total 
number of Qk-s packets generated during the attempts of 
sending Qk-.Sx packets in the previous Phase (Ti U {k}) with 
(Tt U {k}) -< (S U {k}). Similar to the computation in the 
discussion of the left-hand side of (|67| |. there is 



tion Q] collapses to the following single linear inequality: 



E Wk ' s - 

, VTi:TiCS,Ti^S 



-Ti PU([K]\S) 



(68) 



number of Qk-s packets that have been used during the 
previous Phases (T\ U {k}). In the beginning of this phase, 
we send Qk-s^s packets for Wk-s^s number of time slots, 
which can clean up additional 



Wk-S^S -P\J{[K]\S) 



(69) 



number of Qfe ; s packets. Jointly, d68l l. ( 1691 1, and (f66b ensures 
that we can use up all Qk-s packets in Phase (S U {k}). 

The above reasonings show that we can finish the transmis- 
sion in n time slots, make all Xkj have S(Xkj) 3 k, and 
obey the causality constraints. Therefore, the corresponding 
sequential PE scheme is indeed a feasible solution. The proof 
of Proposition |3] is thus complete. 

D. Attaining The Capacity Of Two Classes of PECs 

In this section, we prove the capacity results for symmetric 
1-to-A' broadcast PECs in Proposition [4] and for spatially in- 
dependent broadcast PECs with one-sided fairness constraints 
in Proposition 

Proof of Proposition 0- Since the broadcast channel is 
symmetric, for any Si,Sz G 2^ K \ we have 



RjSi =Pus 2 if I Si | = \S; 



2 • 



Without loss of generality, also assume that R% > i?2 > • • • > 
Rk- By the above simplification, the outer bound in Proposi- 



Rk 
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kTi 



< l. 



(70) 



We use the results in Proposition [3] to prove that (l70b 
is indeed the capacity region. To that end, we first fix an 
arbitrary cardinality-compatible total ordering. Then for any 

S C (\K]\k), we choose 



Wk-S^S 



Rk' 



K 

E 

i=K-\S\ 
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VSi : |Si| = i 
\ ai<]\S) CSiC [K] 



(-1) 



i-(K-\S\) 



PuSi 



J 



and Wk-s^T = for all T being a proper subset of S. The 
symmetry of the broadcast PEC, the assumption that i?i > 
R 2 > ■ ■ ■ > Rk, and d63l l jointly imply that 

x T = w k ,.(r\k*)^(T\k*) where k * = min{t : i G T} (71) 

for all T ^ 0. For completeness, we set x$ = 0. 

By simple probability arguments as first describee^ in 
Section [V-BI we can show that the above choices of Wk-,s->T 
and xt are all non-negative and jointly satisfy the inequalities 
© to (O. 

The remaining task is to show that inequality ([8]) is satisfied 
for any (i?i, ■•• ,Rk) in the interior of the capacity outer 
bound ( fTOl i. To that end, we simply need to verify the following 
equalities by some simple arithmetic computation. 

Vfc G [K], J! x t 

VTe2 [ A '] : feeT,[fe-i]nT=0 

E Wk;(T\k)^(T\k) 
VTe2[ Jf ]:feST,[fe-l]nT=0 



Rk 
Pu[k] 



(72) 



Summing ( |72l over different k values, we thus show that any 
(Ri, • • • , Rk) in the interior of the capacity outer bound ( T70b 
indeed satisfies (8). The proof of Proposition |4] is complete. 

■ 

Proof of Proposition [3} Consider an arbitrary spatially 
independent broadcast PEC with < p\ < P2 < ■ ■ ■ < pk- 
The capacity outer bound in Proposition Q] implies that any 
achievable rate vector (i? 1; • • • , Rk) must satisfy 
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fc=i 



Rk 



nti(i 



pi) 



< i. 



(73) 



We use the results in Proposition [3] to prove that any one- 
sidedly fair rate vector (i? 1; --- ,Rk) G A os f that is in the 
interior of ( 1731 is indeed achievable. To that end, we first fix 

6 Some detailed discussion can also be found in the proof of Lemma \5\ in 
Appendix [D] 



an arbitrary cardinality-compatible total ordering. Then for any 

S Q ([K]\k), we choose 



WfcrS-S-S = Rk 
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-K-\S\ 
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V5i : = i 
\ ([K]\S) C Si C [if] 



PUSi 
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and Wk-s^T = for all T being a proper subset of 5. By 
Lemma [5] in Appendix [D] and by ((63), we have 

^ = m«K;(T\^(T\fc)) 

= w k *-(T\k')^(T\k') where fc* = min{i : i E T} 

for all T / 0. For completeness, we set X0 = 0. 

The remaining proof of Proposition can be completed by 
following the same steps after (T7TT > of the proof of Proposi- 
tion |4] 



VI. Further Discussion of The Main Results 

We provide some further discussion of the main results in 
this section. In particular, we focus on the accounting overhead 
of the PE schemes, the minimum finite field size of the PE 
schemes, the sum rate performance of asymptotically large 
M values, and numerical evaluations of the outer and inner 
bounds for general 1-to-K broadcast PECs. 

A. Accounting Overhead 

Thus far we assume that the individual destination dk knows 
the global coding vector v tx that is used to generate the 
coded symbols (see Line [10] of the main PE scheme). Since 
the coding vector v tx is generated randomly, this assumption 
generally does not hold, and the coding vector v tx also needs 
to be conveyed to the destinations. Otherwise, destinations dk 
cannot decode the original information symbols Xkj for the 
received coded symbols Zk(t), t G [n]. The cost of sending 
the coding vector v tx is termed the coding overhead or the 
accounting overhead. 

We use the generation-based scheme in J4) to average 
out and absorb the accounting overheard. Namely, we first 
choose sufficiently large n and finite field size q such that the 
PE scheme can achieve (1 — e) -portion of the capacity with 
arbitrarily close-to-one probability when assuming there is no 
accounting overhead. Once the n and q values are fixed, we 
choose an even larger finite field GF(g M+ ^=i nfl fc) for some 
large integer M. The larg^e finite field is then treated as a vector 
of dimension M + 2~2k=i n Rk- Although each information 
symbol (vector) is chosen from Xk.j G GF(q M+ ^&=i "- R *) ; 
we limit the range of the Xk.j vector value such that the first 
SfeLi n Rk coordinates are always zero, i.e., no information is 
carried in the first 2~2k=i n Rk coordinates. We can thus view 
the entire systems as sending M coordinates in each vector. 
During the transmission of the PE scheme, we focus on coding 
over each coordinate, respectively, rather than jointly coding 



over the entire vector. The same coding vector v tx is used 
repeatedly to encode the last M coordinates. And we use the 
first Ylk=i n Rk coordinates to store the coding vector v tx . 

Since only the last M coordinates are used to carry infor- 
mation, overall the transmission rate is reduced by a factor 

A J — . By choosing a sufficiently large M, we have 

averaged out and absorbed the accounting overhead. 

B. Minimum Finite Field Size 

The PE scheme in Section ITVl is presented in the context of 
random linear network coding, which uses a sufficiently large 
finite field size GF(q) and proves that the desired properties 
hold with close-to-one probability. The main advantage of this 
random-coding-based description is that the entire algorithm 
can be carried out in a very efficient and distributed fashion. 
For example, with a sufficiently large q, the source s only 
needs to bookkeep the S(Xkj) and v(Xkj) values of all 
the information packets Xkj. All the coding and update 
computations are of linear complexity. On the other hand, 
the drawback of a randomized algorithm is that even with 
very large GF(g), there is still a small probability that after 
the termination of the PE algorithm, some destination dk 
has not accumulated enough linearly independent packets to 
decode the desired symbols Xk,x to Xk : nR k - For the following, 
we discuss how to covert the randomized PE scheme into 
a deterministic algorithm by quantifying the corresponding 
minimum size of the finite field. 

Proposition 6: Consider the \-lo-K broadcast PEC prob- 
lem with COF. For any fixed finite field GF(q ) satisfying 
go > K, all the achievability results in Propositions [2] [3] 
H] and |5] can be attained by a deterministic PE algorithm on 
GF(qo) that deterministic ally computes the mixing coefficients 
{ck : Vfc G T} in Line |7] of the PE scheme. 

The proof of Proposition |6] is relegated to Appendix |C] 

Remark 1: In practice, the most commonly used finite field 
is GF(2 8 ). Proposition |6] guarantees that GF(2 8 ) is sufficient 
for coding over K < 255 sessions together. 

Remark 2: On the other hand, the construction of good 
mixing coefficients {ck '■ Vfc G T} in Proposition |6] is 
computationally intensive. The randomized PE scheme has 
substantial complexity advantage over the deterministic PE 
scheme. 

C. The Asymptotic Sum-Rate Capacity of Large M Values 

We first define the sum-rate capacity as follows: 
Definition 9: The sum-rate capacity R* um is defined as 



Rt, 



sup < Rk : (Ri, ■ ■ ■ , Rk) is achievable > . 



Proposition [5] quickly implies the following corollary. 

Corollary 3: Consider any spatially independent l-to-if 
broadcast PECs with marginal success probabilities < pi < 
p 2 < . . . < p K < l. With COF, the sum-rate capacity satisfies 

l^k=i i- Pk 



sr- K 1 

^ k=1 (i-P*)(i-n?u(i- 



< Rtum < 1- 



Pi)) 




Fig. 6. The 3-D capacity region of a l-to-3 spatially independent broadcast 
PEC with marginal success probabilities pi = 0.7, P2 = 0.5, and P3 = 0.3. 

If we further enforce perfect fairness, i.e., R\ = R-2 = ■ ■ ■ = 
Rk, then the corresponding sum-rate capacity A*^^ fair 
becomes 

n* = ^ 

-^sum,perf .fair v 

^ k=1 (i-nf =1 (i-pi)) 

Proof: Since the sum-rate capacity nR* um is no larger 
than the total available time slots n, we have the upper bound 
R* < 1. Since the rate vector (t^—, t^—, ■ ■ ■ , r^—) 

sum — ^i—pj! 1— p 2 ' ' 1— Pk J 

is one-sidedly fair, Proposition leads to the lower bound 
of R* um . Since a perfectly fair rate vector (R, R, ■ ■ ■ ,R) is 
also one-sidedly fair, Proposition gives the exact value of 

sum, perf. fair' 

Corollary [3]implies the following. Consider any fixed p > 0. 
Consider a symmetric, spatially independent 1-to-A broadcast 
PEC with marginal success probability pi = p% = ■ ■ ■ = pk = 
p. When A is sufficiently large, both the sum-rate capacities 
R* um and R* um perf fair approach one. That is, for sufficiently 
large A', network coding completely removes all the channel 
uncertainty by taking advantage of the spatial diversity among 
different destinations dj. Therefore, each (s,dk) session can 
sustain rate for some e > where f-)0 when K — > oo. 
Note that when compared to the MIMO capacity gain, the 
setting in this paper is more conservative in a sense that it 
assumes that the channel gains change independently from 
time slot to time slot (instead of block fading) while no 
coordination is allowed among destinations. 

This relationship was first observed and proven in lfT31 
by identifying a lower bound of R* um perf fair for symmetric, 
spatially independent PECs. Compared to the results in lfl5l . 
Corollary [3] characterizes the exact value of A* umperf f a ; r and 
provides a tighter lower bound on R* um for non-symmetric 
spatially independent PECs. The R* um perf fair will later be 
evaluated numerically in Section IVI-DI for non-symmetric 
spatially independent PECs. 

D. Numerical Evaluation 

Fig. [6] illustrates the 3-dimensional capacity region of 
(Ri, i?2, R3) of a spatially independent, l-to-3 broadcast 



PEC with COF. The corresponding marginal probabilities are 
pi = 0.7, P2 = 0.5, and P3 = 0.3. The six facets in 
Fig. [6] correspond to the six different permutations used in 
Proposition Q] 

For general 1-to-K PECs with K > 4, we can use the 
outer and inner bounds in Propositions [T] and [3] to bracket the 
actual capacity region. Since there is no tightness guarantee 
for K > 4 except for the two special classes of channels in 
Section IIII-BI we use computer to numerically evaluate the 
tightness of the outer and inner bound pairs. To that end, 
for any fixed K value, we consider spatially independent 
1-to-A' broadcast PEC with the marginal success probabil- 
ities pk chosen randomly from (0, 1). To capture the K- 
dimensional capacity region, we first choose a search direction 
v = (v±, ■ ■ ■ , vk) uniformly randomly from a A'-dimensional 
unit ball. With the chosen values of pk and v, we use a 
linear programming (LP) solver to find the largest t outer such 
that (i?i, • • • , Rk) = («i • Wr, • • • , v K ■ Ur) satisfies the 
capacity outer bound in Proposition Q] 

To evaluate the capacity inner bound, we need to choose a 
cardinality-compatible total ordering. For any set S C [K], the 
corresponding incidence vector lg is a A'-dimensional binary 
vector with the i-tii coordinate being one if and only if i £ S. 
We can also view I5 as a binary number, where the first 
coordinate is the most significant bit and the K -th coordinate is 
the least significant bit. For example, for K — 4, S — {1, 2, 4} 
has I5 = (1,1,0,1) = 13. For two sets S\ ^ S2, we say 
S\ -< S2 if and only if either (i) |5i| = IS2I an d lsi < ls a > 
or (ii) I Si I < 52 1 . Based on this cardinality-compatible total 
ordering, we again use the LP solver to find the largest U nnel 
such that (R 1: ■ ■ ■ , R K ) = (vi ■ t inner , ■■■ , v K ■ t imel ) satisfies 
the capacity inner bound in Proposition [3] The deficiency is 
then defined as defi = f °"'° ~ ti ° ner . We then repeat the above 
experiment for 10 4 times for K = 4, 5, and 6, respectively. 

Note that although there is no tightness guarantee for K > 4 
except in the one-sidedly fair rate region, all our numerical 
experiments (totally 3 x 10 4 ) have defi < 0.1%. Actually, in 
our experiments with A < 6, we have not found any instance 
of the input parameters (pi, • • • ,Pk) and v, for which defi 
is greater than the numerical precision of the LP solver. This 
shows that Propositions Q] and [3] indeed describe the capacity 
region from the practical perspective. 

To illustrate the broadcast network coding gain, we compare 
the sum-rate capacity versus the sum rate achievable by 
time sharing. Figs. [7] and [8] consider symmetric, spatially 
independent PECs with marginal success probabilities p\ = 
■ ■ ■ = Pk = P- We plot the sum rate capacity -R* umiPer f.f air 
versus p for a perfectly fair system. The baseline is the largest 
sum rate that can be achieved by time sharing for a perfectly 
fair system. As seen in Figs.|7]and[8] the network coding gains 
are substantial even when we only have A' = 4 destinations. 
We also note that A* um perf fair approaches one for all p £ (0, 1] 
as predicted by Corollary [3] 

We are also interested in the sum rate capacity under asym- 
metric channel profiles (also known as heterogeneous channel 
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Fig. 7. The sum-rate capacity R* um perf fair in a perfectly fair system versus the 
marginal success probability p of a symmetric, spatially independent i-to-K 
broadcast PEC, K = 2 and 4. 
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Fig. 8. The sum-rate capacity -R* um per f f a ; r in a perfectly fair system versus the 
marginal success probability p of a symmetric, spatially independent i-to-K 
broadcast PEC, K = 20 and 100. 

profiles). Consider asymmetric, spatially independent PECs. 
For each p value, we let the channel gains p\ to pk be equally 
spaced between (p, 1), i.e., pk = p+ (k — l)jf^. We then plot 
the sum rate capacities for different p values. Fig. [9] describes 
the case for K = 6. The sum rate capacities are depicted 
by solid curves, which is obtained by solving the linear 
inequalities in the outer and inner bounds of Propositions Q] 
and [3] For all the parameter values used to plot Fig. [9] the 
outer and inner bounds meet and we thus have the exact sum 
rate capacities for the case of K = 6. The best achievable 
rate of time sharing are depicted by dashed curves in Fig. [9] 
We consider both a perfectly fair system (R, R,- ■ ■ ,R) or 
a proportionally fair system (piR,p2R, ■ ■ ■ ,PkR) for which 
the rate of the (s,dk) session is proportional to the marginal 
success probability pk (the optimal rate when all other sessions 
are silent). To highlight the impact of channel heterogeneity, 
we also redraw the curves of perfectly symmetric PECs with 
pi = ■ ■ ■ = p K = p. 

As seen in Fig. [9] for perfectly fair systems, the sum-rate 
capacity gain does not increase much when moving from 
symmetric PECs p\ = ■ ■ ■ = pk = p to the heteroge- 
neous channel profile with p\ to pk evenly spaced between 
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Success Prob: p 

Fig. 9. The sum-rate capacities for a 6-destination heterogenous channel 
profiles with the success probabilities p\ to pg evenly spaced between (p, 1). 
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Fig. 10. The sum-rate capacities for a 20-destination heterogenous channel 
profiles with the success probabilities pi to P20 evenly spaced between (p, 1). 



(p, 1). The reason is due to that the worst user di (with the 
smallest pi) dominates the system performance in a perfectly 
fair system. When we allow proportional fairness, network 
coding again provides substantial improvement for all p values. 
However, the gain is not as large as the case of symmetric 
channels. For example, when p\ to pk are evenly spaced 
between (0,1). The sum rate capacity of a proportionally 
fair system is 0.56 {p = 0). However, if all p\ to pk are 
concentrated on their mean 0.5, then the sum rate capacity of 
the symmetric channel (p = 0.5) is 0.79. The results show 
that for practical implementation, it is better to group together 
all the sessions of similar marginal success rates and perform 
intersession network coding within the same group. 

We also repeat the same experiment of Fig. [9] but for the 
case K = 20 in Fig. [10] In this case of a moderate-sized 
K = 20, the sum-rate capacity of a perfectly fair system is 
characterized by Proposition|5] On the other hand, the sum-rate 
capacity of a proportionally fair system are characterized by 
Proposition|5]only when all p\ to pk are in the range of [0.5, 1] 
(see the discussion of one-sidedly fair systems in Section lV-DI >. 
Since the evaluations of both the outer and inner bounds have 
prohibitively high complexity for the case K = 20, we use 



the capacity formula of Proposition [5] as a substitute of the 
sum-rate capacity for p < 0.5, which is illustrated in Fig. [10] 
by the fine dotted extension of the solid curve for the region 
of p G [0.5, 1]. Again, the more sessions (K = 20) to be 
encoded together, the higher the network coding gain over the 
best time sharing rate. 

VII. Conclusion 

The recent development of practical network coding 
schemes J4) has brought attentions back to the study of packet 
erasure channels (PECs), which is a generalization of the 
classic binary erasure channels. Since per-packet feedback 
(such as ARQ) is widely used in today's network protocols, 
it is thus of critical importance to study PECs with channel 
output feedback (COF). This work have focused on deriving 
the capacity of general 1-to-A' broadcast PECs with COF, 
which was previously known only for the case K = 2. 

In this work, we have proposed a new class of intersession 
network coding schemes, termed the packet evolution (PE) 
schemes, for the broadcast PECs. Based on the PE schemes, 
we have derived the capacity region for general l-to-3 broad- 
cast PECs, and a pair of capacity outer and inner bounds 
for general 1-to-A' broadcast PECs, both of which can be 
easily evaluated by any linear programming solver for the 
cases K < 6. It has also been proven that the outer and 
inner bounds meet for two classes of 1-to-A" broadcast PECs: 
the symmetric broadcast PECs, and the spatially independent 
broadcast PECs with the one-sided fairness rate constraints. 
Extensive numerical experiments have shown that the outer 
and inner bounds meet for almost all broadcast PECs encoun- 
tered in practical scenarios. Therefore, we can effectively use 
the outer/inner bounds as the substitute for the capacity region 
in practical applications. The capacity results in this paper also 
show that for large K values, the noise of the broadcast PECs 
can be effectively removed by exploiting the inherent spatial 
diversity of the system, even without any coordination between 
the destinations. 

For practical implementation, the COF usually arrives in 
batches. That is, instead of instant per-packet COF, we usually 
have periodic, per-batch COF. The PE scheme can be modi- 
fied to incorporate periodic COF as well. The corresponding 
discussion and some precursory empirical implementation of 
the revised PE scheme can be found in lfl4"l . 
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Appendix A 
A Proof Of Lemma[2] 

Proof: We prove Lemma [2] by induction. First consider 
the end of the 0-th time slot (before any transmission). 

7 When all pi to pk are in [0.5, 1], the formula in Proposition [5] describes 
the capacity. When some pi to pk is outside [0.5, 1], the formula in 
Proposition \5\ describes an outer bound of the capacity. 



Since So(X k j) = for all X k) j and the only di satisfying 
i G (So(Xkj) U {k}) is d k , we only need to check whether 
Vo(Xk.j) is in the linear space span(Oz.fc(0), f2j\/.fe)- Note 
that in the end of time 0, Vo(X k j) is the elementary vector 
Sk,j G &M.k- Lemma |2] thus holds in the end of time 0. 

Suppose Lemma |2] is satisfied in the end of time (t — 1). 
Consider the end of time t. We use T to denote the subset 
chosen in the beginning of time t and use {X k j k : Vfc G 
T} to denote the corresponding target packets. Consider the 
following cases: 

Case 1: Consider those X k) j k such that St(X k ,j k ) = 
S t -i(X k , Jk ). We first note that if Line @] of the UPDATE is 
executed, then St(X k j k ) St-i(X k ,j k )■ Therefore, for those 
X k ,j k such that S t (X k ,j k ) = S t -i(X kJk ), we must have that 
Lines [4] and [5] of the UPDATE are not executed, which implies 
that v t ( x k,j h ) =vt-i(X ktjk ). 

By definition, flz,i(t — 1) Q &z,i(t) for all i G [K] and 
t G [n]. By the induction assumption, we thus have that for 
all d t with i G (S t (X k ^ k ) U {k}) = (S t -i(X k<jh )U {k}), 

G span(fi Zil (i - 1), C span(Q Zii (t), Q M ,i)- 

Vector v t (X k j k ) is thus non-interfering from the perspectives 
of all d h i G \s t (X kJk )U{k}). 

Case 2: Consider those X k i ji that are not a target packet. 
Since those packets do not participate in time t and their 
S(Xk'.j') and v(Xk>j>) do not change from time (t — 1) to 
time t. The same arguments of Case 1 hold verbatim for this 
case. 

Case 3: Consider those target packets X k j k such that 
S t (X k j k ) ^ S t -i(X k j k ). For those target packets X k>jk 
with S t (X k ,j k ) ^ S t -i(Xk,j k ), we must have S t (X k j k ) = 
(T n St-i(X k j h )) U S IX and v t {X k , j k ) - v tx by Lines S 
and [5] of the UPDATE, respectively. Consider any di such that 
i G (St(X k j k ) U {k}). We have two subcases: Case 3.1: 
i G S rx . Since all such di must explicitly receive the new 
v t(X k .j k ) — v tx in the end of time t, we must have 

Vt(Xk,j h ) G span(v tx ) = span(Zj(t)) 

C Slz,i(t) C span(ilz,i(t),ilM,i)- 

Such Vj (X k j k ) is thus non-interfering from d,'s perspective. 
Case 3.2: i G (S t (X kdk ) U {fc})\5 lx . We first notice that 

S t (X k , jk ) U {k} = (T n S t _i(X fcJ J) U S a U {k} 

= ((TU{k})n(S t - 1 (X k , jk )U{k}))US a 
= (Tn(S f _,ft Jt )U{fc}))US IX (74) 
= TUS rx , (75) 

where d74l i follows from that k G T since X k j k is a target 
packet. d23 follows from that (S t -i(X kJk ) U {k}) D T by 
Line [6] of the main structure of the PE scheme. From ( l75l l. the 
i value in this case must satisfy 

i G (S t (X ktjk ) U {k})\S rx = (T U S a )\S a = T\S a . (76) 

Also by Line [6] of the main structure of the PE scheme, for all 
i satisfy {76} we must have i G (T\S a ) CTC (S' t _ 1 (A";. Ji )U 



{I}) for all I G T. By induction, the Vt-ipQj,) vectors used 
to generate the new v tx (totally \T\ of them) must all be non- 
interfering from di's perspective. Therefore 

V/ 6 T, v t _i(* IiA ) G span(fi z ,(t - l),f2 Mji ) (77) 
= span(Qz,i(t),0, M ,i), 

where the last equality follows from that di, i G T\S„s., 
does not receive any packet in time t. Since v tx is a linear 
combination of v t _i(Xij l ) for all I G T, we thus have 

v t (X kJk ) = v tx G span(fi Z)i (i),fi M ,»)- 

Based on the above reasoning, v t (X k j k ) is non-interfering for 
all di with i G (S t (X ktjk )U {k})\S a . 

The proof is completed by induction on the time index t. 

■ 

Appendix B 
A Proof Of Lemma[3] 

Proof of Lemma [3} We prove this lemma by induction 
on time t. In the end of time t = 0, since 

n R:k (0) = span(v (X fcj ) : Vj G [nR k ], k £ S (X k ,j) = 0) 

= span (6 k j : Vfc G [K],j G [nRk]) = &M,k, 

We thus have 

Prob (span(fi z , fc (0), 0^(0)) = span(fi z>fe (0), fi M ,fc)) = 1. 

Lemma [3] is satisfied. 

Consider the end of time t > 0. By induction, the following 
event is of close-to-one probability: 

span(fi z> fc(t-l),nj l ,fc(t-l)) 
= span(f2z, fc (t-l),fi Mj fc). (78) 

The following proofs are conditioned on the event that d78l l is 
satisfied. 

We use T to denote the subset chosen in the beginning of 
time t and use {X k j k } to denote the corresponding target 
packets. Consider the following cases: 

Case 1: Consider those k G T such that the corresponding 
target packet X k j k either has St{X k> j k ) = S t -i(X k j k ) or 
has k G St-i(X k j k ). For the former subcase St(X k j k ) = 
St-i(X kt j k ), by Line 0] of the UPDATE, we must have 
v t (X k j k ) = v t -i(X ktjk ). Since X ktjk is the only packet 
among {X k j : Vj G [nR k ]} that participate in time t, for 
which the corresponding v(X k j) coding vector may change, 
we must have v t (X k j) = Vt-i(X k A for all j G [nR k ]. We 
then have 

Q, R , k (t) = span(v t (X fc ,.,) : Vj e [nR k ],k £ S t (X k ,j) 

= span(v t _i : Vj G [nR k ], k £ S t -i(X ktj )) 

= fijyfe(t-l). (79) 

We note that for the latter subcase k G St-i(X k j k ), we must 
have T C (S t -i(X k j h ) U {fc}) = S t -i(X kjk ) by Line i of 
the main PE scheme. Therefore Line|4]of the UPDATE implies 
that k G St(X k .j k ) as well. Since the remaining space Sink 



only counts the vectors v(X k j) with k £ S(X k .j), d79l holds 
for the latter subcase as well. For both subcases, let w k (t) 
denote the corresponding coding vector of Z k (t), which may 
or may not be an erasure. We then have 

span(Oz jfe (t),Oi ljfe (t)) 
= span(w fc (», £l z ,k(t - 1), &R,k(t)) 
= span(w fe (i), (l z>k (t - 1), Q fi , fc (t - 1)) 
= span(w fe (i), f! z , fc (t - 1), M)fc ) (80) 
= span(0^ fe (i),OM,fe), 

where ( f80l > is obtained by the induction condition d78] >. 
Lemma [3] thus holds for the fc values satisfying Case 1 . 

Case 2: Consider those di with I £ T. Since no Xi.j packets 
participate in time t and their S(Xi i j) and v(Xij) do not 
change in time t. The same arguments of Case 1 thus hold 
verbatim for this case. 

Case 3: Consider those k G T such that the corresponding 
target packet X kJk has S t (X kJk ) ^ S t ^i(X kJk ) and k £ 
S t -i(X k j k ). Define tt' R as 

fl' R = span{v t _i(X fej ) : Vj G [nR k }\j k ,k £ S t -i(X kJ )}. 

(81) 

Note that the conditions of Case 3 and (l8TT l jointly imply that 
^R,k(t— 1) = span(v f _i (X k j k ), r2jj). We have two subcases 
Case 3.1: k £ S t {X k , 3k ) and Case 3.2: k G S t (X kijk ). 

Case 3.1: fc ^ S t (X k j k ). By Line @] of the Update, 
we have k £ S IX , i.e., d k receives an erasure in time t. 
Therefore £l Z}k (t) = ^z,k(t — 1). We will first show that 
span(n z , k {t),nR,k(t)) C span(n z , k {t),tt M ,k)- 

Since the target d fc satisfies k G T C (5 t _ 1 (X ;ji ) U {/}), 
for all I G T, by Lemma [2] all those vt-i^-X^,) are non- 
interfering from life's perspective. That is, 

VI G T, v t -i(X hjl ) G span(O z . fe (i - l),0 M ,fe) 

= span(O z , fe (t),OM,fc). (82) 

As a result, we have v t (A" fc jJ = v tx G span(O z . fc (t), Ojvf.fc) 
since v tx is a linear combination of all v t -i(Xij l ) for all 
Z G T. Therefore, we have 

span (fiz,fc(t),Q.R,fc(£)) 
= span (ft Zife (t),v t (X fcijfc ),fi^) 
C span(fJ Zit (i), f2 z ,i(t), ^M,fc, 
= span(O z , fe (t),Q M , fe ,Q' i j). (83) 

Since we condition on the event that (l78l l holds, we have 

span(n z , k (t),n' R ) C span (fi^ fc (i), v t _i(X fe . jfc ), fi^) 
= span (O z ,fe(t), R ,fe(i - 1)) 

= span(fiz,jk(t-l),nji,fc(*-l)) 

= span (il z , k {t - 1), fi M ,fc) 

= span(fi z>fe (t),fi M ,fc). (84) 

Joint ([83li and (|84j» show that span (0 Zife (t), Q,R tk (t)) C 
span(rj z ^(t),fi M ,fc)- 



To prove Lemma [3] for Case 3.1, it remains to show that the 
event span (Slz,k(t), ^R,k{t)) 2 span(fi Z)fc (i), 0,M,k) is of 
close-to-one probability, conditioning on ( 1781 being true. We 
consider two subcases: depending on whether the following 
equation is satisfied. 

v t _i(x fciJ -j g span (n Zjfc (i - i),n' R ) 

= span(n Zifc (t),f2 / fl ). (85) 

Case 3.1.1: If (f85T > is satisfied, then we have 

span {Cl z ,k(t),^R,k(t)) 
= span (tt Ztk (t),v t (Xk,j k ),£l R ) 
2 span (Q,z,k(i),to'n) 

= span(^ fc (t),vt_ 1 (X fcj J,ri , /i ) (86) 

= span (Slz,k(t), ^R,k(t - 1)) 

= span (fiz )fc (t-l), -1)) 

= span(O z , fc (i-l),0 Mife ) (87) 

= span(O z , fc (i),n Afife ), (88) 

where d86t follows from (ISBT l. and d87l i follows from the 
induction condition ( |78] >. 

Case 3.1.2: dSBT l is not satisfied. By the equality between 
(l86l l and d88), we have 

span (n z ,k{t),v t -i{X k j k ),n' B ) = span(Oz jfe (t),OM,fe)- 

(89) 

Recall that \r t (X k j k ) = v tx is a linear combination of 
Vt-ipQj,) satisfying in (l82l . By ( l89l and the assumption that 
(|8~5| > is not satisfied, we thus have that each ■vt-i(Xij l ) can be 
written as a unique linear combination of av t -i(X k ,j k ) + w 
where a is a GF(g) coefficient and w is a vector satisfying 
w G span (ftz,k(t), &' R )- By the same reasoning, we can 
rewrite v t (X k ,j k ) as 

VtOXfc,jJ = CfcV t _i(X fc)J -J+ ^ QV f _i(X; J; ) 

V(GT\fe 

= CfeVt-^XfejJ + (av t -i(X kdk )+w) 

= (c fe + a)v t _i(X fc , i J+w. (90) 

where a is a GF(q) coefficient, w is a vector satisfying w G 
span (Slz,k{t)> an d the values of a and w depend on the 
random coefficients q for all I ^ k. As a result, we have 

span (Oz,fc(i),fiii,fc(t)) 
= span (fi Zjfc (t),v t pr fc ,.,-J,Q#) 
= span (0 Zfe (t), ((c fe + a)v t -i(Xk !jk ) + w),Cl' R ) . 

Since d85l l is not satisfied and w G span (il Zyk (t), we 
have 

span (Qz,k(t), ((c fc + a)v t -i(X k>jk ) + w) ,Cl' R ) 
= span(n z ,k(t),^t-i(Xk,j k ),^' R ) 
= span(Q z , fe (t),0 M)fc ) (91) 

if and only if (c k + a) ^ 0. Since c k is uniformly distributed 
in GF(q) and the random variables c k and a are independent, 



the event that (l9lT > is true has the conditional probability 
conditioning on (l78T l being true. For sufficiently large q values, 
the conditional probability approaches one. 

Case 3.2: k G S t (X kJk ). Recall that for Case 3, we 
consider those k such that k £ St-i(X k j k ). By Line|4]of the 
Update, we have k G S IX , i.e., d k receives the transmitted 
packet perfectly in time t. Therefore, in the end of time t, 
&Rk(t) — ^'r> which was first defined in < f8TT >. 

We consider two subcases: depending on whether the fol- 
lowing equation is satisfied. 

v t _ 1 (jr fc)i jGspan(n Zlfc (t-l),ni i ). (92) 

Case 3.2.1: If (l92l is satisfied, then we have 

span (Cl Zl k(t), &R,k(t)) = span (£lz,k(t), Mr) 
= span (v t (X kijh ), Q z>k {t - 1), n' R ) 
= span(v t (X fcJ J,O z , fc (i-l),v t _ 1 (X fcJ J,0^) (93) 
= span(v t (X feJ J,O z , fc (i-l),O fljfc (i-l)) 
= span(v t (X fc , jfc ),fi Zjfe (* - l),OM,fc) (94) 
= span(Q z , fc (t),f2jvf,fc), 

where d93l follows from (l92l . and d94l i follows from the 
induction assumption (l78l . 

Case 3.2.2: (|92l is not satisfied. By the induction assumption 
(|78l , we have 

span (Q z , k {t - 1), v t _i(X fcjJ - fe ), 0^) 

= span(Q z , fc (i-l),ft M ,fc)- (95) 

Since the target d fc satisfies fc G T C (S' t _ 1 (X ;j; ) U {/}), 
for all ( 6 T, by Lemma [2] all those v t _i(X; i:/i ) are non- 
interfering from dfc's perspective. That is, 

V/ G T, v t _i(X Wl ) G span(O z , fe (t - l),0 Mjfc ). (96) 

By (1951 . (l96l ). and the assumption that d92l is not satisfied, 
each v t _i(X; j:)i ) can thus be written as a unique linear combi- 
nation of av t _i(XfejJ+w where a is a GF(g) coefficient and 
w is a vector satisfying w G span (Q Zyk (t — l),Cl' R ). Since 
v t(X k .j k ) = v tx is a linear combination of v t _i(X; Ji ), by 
the same reasoning, we can rewrite v t (X k j k ) as 

v t {X k , jk ) = c k v t -i(Xk,j k ) + civ t -i{Xi >k ) 

Vl£T\k 

= c fe v 4 _i(X fcii J + (av t -i{X k , jk ) + w) 

= {c k +a)v t - 1 {X kdk )+w. (97) 

where a is a GF(g) coefficient, w is a vector satisfying w G 
span (Q Zt k(t — 1), Q' R ), and the values of a and w depend on 
the random coefficients c; for all I ^ k. As a result, we have 

span (Q z>k (t), flR, k (tj) = span (Q z ,k(t), il R ) 
= span (v t (X kJk ),n Z!k (t - 1), n^) (98) 
= span (v t (JT fc)i J, ttz, fc (< - 1), v t _i (X fe . Jfc ), fi^) (99) 
= span(v t (X fc , i J,O z , fe (i-l),fi Rife (i-l)) 
= span (vt{X kijk ),Q Z)k (t - l),Q M ,k) 
= span (f2z,fc(t),0 Mjfe ) , 



where the equality from < f98l > to ( f99b is true if and only if the 
(cfc + a) in d97l ) is not zero, since d92l is not satisfied and 
w e span (Qz,k(t - 

Since is uniformly distributed in GF(g) and the ran- 
dom variables c k and a are independent, the event that 

span (Slz,k(t)>ftR,k(t)) = s P an i^z,k(t), &r,m) has the con " 
ditional probability SLzi > conditioning on d78l l being true. 
For sufficiently large q values, the conditional probability 
approaches one. 

Combining all cases: Let At denote the event that 
span(fiz ) k(i),fifl,jfe(t)) = span(f2^,fe(t), Oiw.fe) and let T 
denote the target set chosen in time t. Since for Cases 3.1.2 
and 3.2.2 the conditional probability of At given At-i is 
lower bounded by and for all other cases the conditional 
probability is one, the discussion of Cases 1 to 3.2 thus proves 
the following inequalities: 



1 



Prob(A|A-i,T) > 1- 



Since for any T C [K] we must have \T\ < K, we then have 



Prob(A|A-i) > 1 - - 

q 



K 



By concatenating the conditional probabilities, we thus have 
Prob (span(Q z ,k(i),^R,k(t)) = span(Q z ,k(t),^M,k)) 



> I 1 - - 

q 



IK 



> I 1 - - 

q 



nK 



(100) 



As a result, for any fixed K and n values, we can choose a 
sufficiently large finite field GF(g) such that (1 100b approaches 
one. Lemma [3] thus holds for all fc G [K] and t € [n]. 



Appendix C 
A Proof Of Proposition |6] 

Proof of Proposition® To prove this proposition, we will 
show that for any go > K, the source s can always compute 
the mixing coefficients {c k : Vfc G T} in Line [7] of the PE 
scheme, such that the key properties in Lemmas [2] and [3] hold 
with probability one. Then for any PE scheme, we can use 
the computed mixing coefficients {c k : Vfc G T} instead of 
the randomly chosen ones, while attaining the same desired 
throughput performance. 

We first notice that the proof of Lemma [2] does not involve 
any probabilistic arguments. Therefore, Lemma [2] holds for 
any choices of the mixing coefficients with probability one. 

We use induction to prove that when using carefully com- 
puted mixing coefficients {c k ■ Vfc G T}, Lemma [3] holds 
with probability one. We use the same notation of St(X k .j), 
v t (Xk,j), ^Rjc(i), fiz,k(*), fW,fc. &'r as defined in Lemma[3] 
and its proof|3 

8 We note that in jSU actually depends on the value of k and the time 
index (t — 1). 



In the end of time t = 0, since 

Q R ,k(0) = span(v (X fcJ ) : Vj G [nR k ], k t S (X ktj ) = 0) 
= span (4 d : Vfc G [K] , j G [nR k ]) = fl M ,k, 

we have 

Prob(span(fi^ fe (0),O fi)fc (0)) = span(Q Z)fc (0), M , fe )) = 1. 

Lemma |3] holds with probability one for any finite field 
GF(go). 

Assume that in the end of time (t— 1), Lemma [3] holds with 
probability one. Suppose T is chosen in the beginning of time 
t. Define B t as the set of fc values satisfying: 

Bt = {Vfc G T :k i S t -i(X kJk ) and 

vt-i(Xkj h ) i span(n z ,ik(t-l) > nfl')}. 

Note that this _B t can be computed in the beginning of time 
t. Once _B f is computed, we would like to choose the mixing 
coefficients {q : VI G T} such that the following equation is 
satisfied. 



Vfc G B t , Vtx 



W-t^ij,; 



= c fc v t _i(X fejfc ) + ^ 
vieT\k 



ci\ t _x(X lA ) 

(101) 



Note that for any fc G B t , we have Vt-i{X k j k ) £ 
span(0^.fc(t— 1), Qri). Therefore if we choose the coefficients 
{ci : VI G T} uniformly randomly, the probability that 

c k v t -i(X ktjk ) + c iVt-i{X Ljl ) 

VleT\k 

Gspan(n z , fc (t-l),njy) (102) 



go ' 



is at most The probability that there is at least one fc G T 



\B t \ 
90 



< — . For any 

90 K, 



satisfying (1102b has probability at most 
qo > K, we thus have a non-zero probability > (1 — j^) such 
that the uniformly random choice of {c; : VI G T} will satisfy 
( I10U . Therefore, there must exist at least one {c/ : Vi G T} 
satisfying d 1 1 b . In the beginning of time t, we arbitrarily 
choose any such mixing coefficients {c; : VZ G T} that satisfy 
JTOTV 

The remaining task is to show that the above construction 
of {c k : Vfc G T} guarantees that Lemma[3]holds in the end of 
time t with probability one, regardless the channel realization 
of time t. 

For those fc ^ T, such fc falls into Case 2 of the proof of 
Lemma [3] Since Case 2 holds with probability one, Lemma [3] 
is true for those fc ^ T with probability one. For those fc G T 
and fc G St-i(X k j k ), then such fc falls into Case 1 of the 
proof of Lemma [3] Since Case 1 holds with probability one, 
Lemma [3] is true for those fc G T and fc G St-i(X kt j k ) with 
probability one. 

For those fc satisfying: fc G T, fc ^ St-i(X kt j k ), and 
Vt-i (X k j k ) G span(S!,2,/c(£ — 1),^R')> sucn ^ must fall i nt0 



Case 1, Case 3.1.1, or Case 3.2.1, depending on whether 
S t (X k ,j k ) = S t -i{X k . Jk ) and whether k G S t {X k ,j k ), respec- 
tively. Since Cases 1, 3.1.1, and 3.2.1 hold with probability 
one, Lemma [3] is true for those k with probability one. 

The remaining fc's to consider are those k G B t . If the 
random channel realization leads to St(Xkj k ) = St-i(Xk .j k ), 
then by Case 1 of the proof of Lemma [3] we must have 
Lemma|3]holds with conditional probability one. If the random 
channel realization leads to S t (Xkj k ) ^ S t -i(Xk j k ) and 
k £ St(Xk j k ), then we are in Case 3.1.2. Since for those 
k G B t we have chosen the mixing coefficients {cj : VI G T} 
satisfying ( 11011 . following the same arguments as in (|90T > we 
must be able to rewrite v t (Xk j k ) as follows. 

M X k,j k ) = vtx 

= (c fc + a)v t -\(X k ,j k ) + w 

where (cu + a) is a non-zero GF(q) coefficient, and w is a 
vector satisfying 

w e span (Tt z ,k(t - 1), Cl' R ) = span (Cl z ,k(t), tt' R ) ■ 

Following the same proof of Case 3.1.2 of Lemma [3] we must 
have Lemma [3] holds with conditional probability one. If the 
random channel realization leads to S t (Xkj k ) ^ S t -i{Xkj k ) 
and k G St(Xkj k ), then we are in Case 3.2.2. Since for those 
k G Bt we have chosen the mixing coefficients {a : VI G T} 
satisfying (1101b . following the same arguments as in (|97| | we 
must be able to rewrite v t (Xk.j k ) as follows. 

M X k,j k ) = v tx 

= (c fe + a)v t -i(X kt j k ) + w' 

where (cfc + a) is a non-zero GF(q) coefficient, and w' is a 
vector satisfying w' G span (Qz,k(t — 1), Q' R )- Following the 
same proof of Case 3.2.2 of Lemma[3] we must have Lemma[3] 
holds with conditional probability one. Since regardless of the 
random channel realization, Lemma [3] holds with probability 
one, we have thus shown that one can always construct the 
desired mixing coefficients {c; : VI £ T) provided the finite 
field GF(qo) satisfying qo > A'. By induction on t, the proof 
is complete. 



Appendix D 

A Key Lemma For The Proof Of Proposition \5\ 

Consider an arbitrary spatially independent 1-to-A' broad- 
cast PEC with marginal success probabilities < pi < p2 < 
■ • • < Pk- For any S C [K] and S ^ [K], define 

/ \ 

/jY-Cfc-isi: 



L s = 



K 

E 

i=K-\S\ 



E 



V5i : | Si | = l 
\ (\K]\S) C Si C [K] 



Pusi 



We then have the following lemma: 

Lemma 5: Suppose the l-to-A' broadcast PEC is spa- 
tially independent with marginal success probabilities < 
Pi < • • • < Pk- Consider any one-sidedly fair rate vector 



(Ri,--- ,Rk) G A os f, and any non-empty subset T C [K\. 
For any ki, G T with k\ < k2, we have 



Rk! ■ L T \ kl > Rk 2 



J T\k 2 



Proof: Consider K independent geometric random vari- 
ables Xi to Xk with success probability p\ to pk ■ That is, 
the probability mass function Fk(t) of any Xk satisfies 



F fc (t) = Prob(A fc =t) =Pk{l-Pk) t -\ 

for all strictly positive integer t. For the sake of simplicity, 
here we omit the discussion of the degenerate case in which 
Pk = 1. We say that the geometric random trial Xk is finished 
at time t if X k = t. For any S C [K] and S ^ [K], define 
three random variables 

Y [K] \ S = mm(Xi : i G [K]\S) (103) 
W s = max(X; :teS) (104) 
Ts = Y [K] \ S - mm(Y [K] \ s , W s ). (105) 
Intermediate Step 1: We will first show that 
L S = E{T S }. 

To that end, for any time t, we mark time t by a set I t = {i G 
[K] : Xi < t}. We then have 



Ts = Y [K]\s ~ n)ia(Yjjf]\ S) Ws) = ^2 1 l I t=s}- 
By noting that 



/=i 



t < Y, 



[K]\S 



It.cs, 



we also have 



Y 



[K]\S 



Ei 

t=i 



{t<Y [K]ys } 



= E 1 { / *es}= E Ts '- 



t=i 



VS'iS'CS 



(106) 



Taking the expectation of ( 1106b . we then have 



i 



VS':S'CS 



PU([K]\S) 

(107) 

Solving the simultaneous equations (11071 ). we have 



K 



E{r S '}= E 

i=K-\S'\ 



E 



(-i) 



i-(K-\S'\) 



\ 



VSi : |Si| = l 
\ {[K]\S>) C Si C [A'] 



Pusi 



for all S' C [AT] and 5' ^ [A]. 

Intermediate Step 2: We will show that for any non-empty 
subset T C [A'] and any gT with fci < fcg, we have 



I 



T\fci 



> 



\Pfci 



L T\k 2 
1 -Pfe 



(108) 



For any realization (X%,--- ,Xk) = {x\,--- ,xk), we use 
V[K]\s> W S> an d 7s t° denote the corresponding values of 
Y [K]\s, W s , and T s according to ([T03l (fl04l and lfl05t . 
respectively. We then have 



V(xi,--- -x/s:) k—1 



K 



e 7n fel n^^)- < io9 > 

V(a:i,— ,x K ):j T \ kl >0 fc=l 

Note that the only difference between EjlY^} and 
E{ry\j. fc } is the underlying measures of and X k2 . 

Therefore, by the change of measure formula, we have 

E { T T\k.2 } = E 7T \ fe i ' 

V(aji,— ,iK):iT\t, >0 



^k(£fci) F kl {x k2 ) 



K 



\F k (x k ). (110) 



fe=i 



Note that when jT\ kl > 0, we must have y([/<-]\T)u{fci} > 
WT\ kl , which in turn implies that x kl > x k . 2 + 1. We then 
have 

F k2 (x kl ) F kl (x k2 ) = p k2 {l -p k2 ) Xk i p kl (l - p kl ) Xk i 
F kl (x kl ) F k2 (x k2 ) ~ p kl (l -p kl ) Xkl Pfc 2 (l ~p k2 ) Xk2 
'l- Pk2 ^ Xkl ~ Xk2 




(111) 

where the last inequality follows from p kl < p k2 and x kl > 
x k . 2 + 1. Combining ( 11091 l, (1110b . and ( lllll i. we thus have 

which implies ( |108b . 

Final Step 3: Since (R±, • • • , Rk) S A ost , by the definition 
of one-sided fairness, we have 



(112) 



Multiplying (|1081 l and (11 12b together, the proof of Lemma [5] 
is complete. 

■ 
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